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Abstract 

We prove a theorem giving the asymptotic number of binary quartic forms having bounded invariants; 
this extends, to the quartic case, the classical results of Gauss and Davenport in the quadratic and cubic 
cases, respectively. Our techniques are quite general, and may be applied to counting integer orbits in 
other representations of algebraic groups. As an example, we illustrate the method on a particularly rich 
nonreductive representation having dimension 12. 

We use these counting results to deduce a number of arithmetic consequences. First, we determine 
the mean number of 2-torsion elements in the class groups of monogenic maximal cubic orders. We 
similarly determine the mean number of 2-torsion elements in the class groups of maximal cubic orders 
having a monogenic subring of bounded index. Surprisingly, we find that these mean values are different! 
This demonstrates that, on average, the monogenicity of a ring has a direct altering effect on the behavior 
of the class group. 

Finally, we utilize all the above results to prove that the average rank of elliptic curves, when ordered 
by their heights, is bounded. In particular, we prove that when elliptic curves are ordered by height, the 
mean size of the 2-Selmer group is 3. This implies that the average rank of elliptic curves is at most 1.5. 
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1 Introduction 

1.1 Average ranks of elliptic curves 

Any elliptic curve E over Q is isomorphic to a unique curve of the form Ea.b '■ y 2 — x 3 + Ax + B, where 
A, B 6 Z and for all primes p: p 6 \ B whenever p 4 | A. Let H(Ea,b) denote the (naive) height of 
Ea,b, defined by H(Ea,b) '■— max{4|A 3 |, 27B 2 }. Let £\(Ea,b) and C(Ea i b) denote the discriminant and 
conductor of Ea,b, respectively. 

It is an old conjecture, originating in works of Goldfeld |H] and Katz-Sarnak [35], that a density of 
50% of all elliptic curves over Q have rank and 50% have rank 1. These densities are expected to hold true 
regardless of whether one orders curves by height, discriminant, or conductor. In particular, one expects the 
average rank of all elliptic curves to be 1/2. However, it has not previously been known that the average 
rank of all elliptic curves is even finite (i.e., bounded). 

In |14) . Brumer showed that the generalized Riemann hypothesis and the Birch-Swinnerton-Dyer 
conjectures together imply that the average rank of all elliptic curves, when ordered by their heights, is 
finite and is in fact bounded above by 2.3. Still assuming the generalized Riemann hypothesis and the 
Birch-Swinnerton-Dyer conjectures, this constant was subsequently improved to 2 by Heath-Brown |30) and 
to 25/14 - 1.79 by Young 

The purpose of this article is to prove unconditionally that the average rank of all elliptic curves, 
when ordered by their heights, is finite. In fact, we prove the same for the 2-Selmer rank. Recall that the 
2-Selmer group 82(E) of an elliptic curve fits into an exact sequence 

0^E(Q)/2E(Q)^S 2 (E)^UI E [2]^0, (1) 
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where Ills [2] denotes the 2-torsion subgroup of the Tate-Shafarevich group UIe of E. The 2-Selmer group 
is thus an elementary abelian 2-group, and its order is 2 s for some integer s > 0; the quantity s is called the 
2-Selmer rank of E. 

Our main theorem on the 2-Selmer group is as follows: 

Theorem 1.1 When all elliptic curves E/Q are ordered by height, the average size of the 2-Selmer group 
S 2 (E) is 3. 

We immediately conclude that: 

Corollary 1.2 When all elliptic curves over Q are ordered by height, their average 2-Selmer rank is at most 
1.5; thus their average rank is also at most 1.5. 

Indeed, note that equation ([1} implies that 

r 2 (S 2 (E)) = r(E) + r 2 {E[2]) + r 2 (HI E [2]) , (2) 

where we have used r(E) to denote the rank of E and r 2 {G), where G is an elementary abelian 2-group, 
to denote dhriF 2 (G). Theorem 11.11 bounds the left hand side of ([2]) by 1.5, and thus the same bound holds 
for the average size of each of the terms on the right hand side of <j2j) - In particular, the average size of 
^(Hlfip]) is also at most 1.5. Meanwhile, we will show in Section 5 that the mean size of r2(E[2\) is 0, i.e., 
0% of elliptic curves possess rational 2-torsion. 

We will in fact prove a stronger version of Theorem II .![ namely: 

Theorem 1.3 When elliptic curves E : y 2 — g{x), in any family defined by finitely many congruence 
conditions on the coefficients of g, are ordered by height, the average size of the 2-Selmer group 82(E) is 3. 

Thus the average size of the 2-Selmer group remains 3 even when one averages over any subset of elliptic 
curves defined by finitely many congruence conditions. We will actually prove Theorem ll.3l for an even larger 
class of families, including some that are defined by certain natural infinite sets of congruence conditions. 

1.2 Counting binary forms having bounded invariants (particularly quartic 
forms) 

We prove the above theorems by developing techniques to count integer orbits, having bounded invariants, 
in certain coregular representations over Z. Recall that a coregular representation is a pair (G, V), where 
G is an algebraic group and V is a representation of G (for our purposes, both defined over Z) such that 
the ring of polynomial invariants of G on V is free. Although our techniques are quite general, in this 
article we concentrate primarily on the case where G = GL 2 and V is the space of binary quartic forms 
ax A + bx 3 y + cx 2 y 2 + dx 3 y + ey A . 

The problem of counting integral binary forms having bounded invariants is a classical one. The 
case of binary quadratic forms was first treated in the influential work Disquisitiones Arithmeticae of Gauss 
in 1801. Gauss studied the action of SL2(Z) on the space of integral binary quadratic forms f(x,y) — 
ax 2 + bxy + cy 2 (a, 6,c£ zj]] via linear substitution of variable, in terms of the unique polynomial invariant 
for this action, namely the discriminant A(/) = b 2 — Aac. (The polynomial invariant A(/) is "unique" in 
the sense that the ring of polynomial invariants is generated by one element, namely A(/).) 

Gauss conjectured, and Mertens [36] and Siegel [39] proved, respectively, that: 

Theorem 1.4 (Gauss 1801/Mertens 1874/Siegel 1944) Lethn denote the number of 'SI^Z)- equivalence 
classes of irreducible integral binary quadratic forms having discriminant D. Then: 

(a) £ h D ~ I.XW; 

-X<D<0 

1 Gauss actually considered only forms where b is even; however, from the modern point of view, it is natural to allow all 
three coefficients a, b, c to be arbitrary integers. 
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(b) £ ^i og£D ~ y&-x 3/2 ; 

a<D<x 

here Ed = (t + uyD)/2, where t,u are the smallest positive integral solutions oft 2 — du 2 = 4. 

Note that hjj and log to have important algebraic number theoretic interpretations, namely, h(D) is the class 
number and log er> is the regulator of the unique quadratic order of discriminant D. Thus Theorem II .4f a) 
gives the average size of the class number of imaginary quadratic orders up to a given absolute discriminant, 
while (b) gives the average size of the class number times the regulator of real quadratic orders up to a given 
discriminant. 

The next natural case to consider is that of integral binary cubic forms f(x, y) — ax 3 + bx 2 y + cxy 2 + 
dy 3 (a, b,c,d £ Z). The group GL.2(Z) (or SLi2(Z)) again naturally acts on such forms, and there is again a 
unique polynomial invariant for this action, namely, the discriminant 

A(/) = b 2 c 2 + 18abcd - Aac 3 - Ab 3 d - 21 a 2 d 2 . 

The question, as in the case of binary quadratic forms, is: how many classes h(D) of irreducible binary cubic 
forms are there with discriminant D, on average, as D varies? 
This question was first answered by Davenport |23j : 

Theorem 1.5 (Davenport 1951) Leth(D) denote the number of GL/2(Z)- equivalence classes of irreducible 
integral binary cubic forms having discriminant D. Then: 

(a) £ h(D) ~ ~X; 

-X<D<0 

( b ) E h W ~ ^- X - 

0<D<X 

Davenports' theorem thus states that the number of equivalence classes of irreducible binary cubic forms 
per discriminant is a constant on average. This too has an important algebraic number theoretic interpreta- 
tion. Since irreducible binary cubic forms are in bijection with orders in cubic fields (see Delone-Faddeev's 
work [H]), Theorem 11.51 states that there are a constant number of (isomorphism classes) of cubic orders 
per discriminant, on average. Davenport's Theorem was an essential ingredient in the classical work of 
Davenport and Heilbronn on the density of discriminants of cubic fields (see [24]). 

The next natural case to consider is that of binary quartic forms. The group GL2(Z) again acts 
on the space of binary quartic forms f{x, y) = ax 4 + bx 3 y + cx 2 y 2 + dxy 3 + ey A (a, b, c,d,e £ Z) by linear 
substitution of variable. Note that in each of the cases of binary quadratic and binary cubic forms, the ring 
of invariants was generated by one element. Binary quartic forms historically have been more difficult to 
treat because the ring of invariants is now generated by two independent invariants, traditionally denoted I 
and J. For f(x,y) as above, we have the following explicit formulae for these invariants: 

/(/) = Viae- 3bd + c 2 , 

J {f) = 72ace + 9bcd ~ 27 'ad 2 ~ 21 eb 2 - 2c 3 . 

Any other polynomial invariant for the action of GL2(Z) on binary quartic forms can be expressed as a 
polynomial in these invariants; for example, the discriminant A(/) of a binary quartic form can be expressed 
in terms of /(/) and J(f) as follows: 

A(/) :=A(/,J) := (4/(/) 3 - J(/) 2 )/27. 

It follows from work of Borel and Harish- Chandra [13] that the number of equivalence classes of 
integral binary quartic forms, having any given fixed values of / and J (so long as both / and J are not 
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both equal to zero), is finite^ This raises the question as to how many classes h(I, J) of irreducible binary 
quartic forms with invariants /, J are there, on average, as the pair (/, J) varies? 

To answer this question, we require just a bit of notation. Let us define the (naive) height of f{x, y) 
by H(f) :— H(I,J) := max{|/ 3 |, J 2 /4} (the constant 1/4 on J 2 is present for convenience, and is not 
of any real importance). Thus H(f) is a "degree 6" function on the coefficients of /, in the sense that 
H(rf) — r 6 H(f) for any constant r. We prove: 

Theorem 1.6 Let h^(I.J) denote the number of GL2(Z)- equivalence classes of irreducible binary quartic 
forms having 4 — 2i real roots in P 1 and invariants equal to I and J. Then: 

(a) J2 ^ (0) ( 7 ' J ) = ^C(2)X B / 6 + 0(* 3/4+e ); 



H(I,J)<X 



(b) £ hW(I,J) = ^ C (2)X 5/6 + 0(X 3 /^): 



H{I,J)<X 



(c) 



£ h™{I,J) = ^C(2)* B/8 + 0(* 3/4+e ). 

H(I,J)<X 



In order to obtain the average size of hP\l,J), as (J, J) varies, we first wish to know which pairs 
(I, J) can actually occur as the invariants of an integral binary quartic form. In the quadratic and cubic 
cases, this is easy and well-known: a number occurs as the discriminant of a binary quadratic (resp. cubic) 
form if and only if it is congruent to or 1 (mod 4) . 

In the binary quartic case, we prove that a similar scenario occurs, namely, an (/, J) is eligible — i.e., 
it occurs as the invariants of some integer binary quartic form — if and only if it satisfies any one of a certain 
specified finite set of congruence conditions modulo 27. More precisely, we prove: 

Theorem 1.7 A pair (J, J) G Z x Z occurs as the invariants of an integral binary quartic form if and only 
if it satisfies one of the following congruence conditions: 

(a) 1 = (mod 3) and J = (mod 27), 

(b) 1 = 1 (mod 9) and J = ±2 (mod 27), 

(c) 1=4 (mod 9) and J = ±16 (mod 27), 

(d) 7=7 (mod 9) and J = ±7 (mod 27). 

It follows that the number of eligible (I, J), with H (I, J) < X , is a constant times X 5 / 6 ; thus, by 
Theorem 11.61 the number of classes of binary quartic forms per eligible (I, J) is a constant on average. We 
have the following theorem: 

Theorem 1.8 Let J) denote the number of GL2(Z)- equivalence classes of irreducible binary quartic 

forms having 4 — 2i real roots and invariants equal to I and J. Let uq = 4, ri\ = 2, and ri2 = 2. Then, for 
i = 0,1, 2, we have: 

Hm H(ij)<x 2C(2) 

(I, J) eligible 
(-1) I A(7,J)>0 
H(I,J)<X 



2 A deeper — and more ineffective — result of Birch and Merriman states that the number of equivalence classes of binary 
quartic forms, having a fixed value of the single invariant A(/) = 4J(/) 3 — J(/) 2 , is finite. However, the latter fact will not be 
of importance to us here. 
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Thus, Theorem 11.81 savs that the number of equivalence classes of binary quartic forms per eligible (I, J), 
having a given number of real roots, is a constant on average. This constant is either £(2)/2 or C(2), 
depending on whether the given number of real roots is 4 or less than 4, respectively. 

We in fact prove a strengthening of Theorem ll.81 namely, we obtain the asymptotic count of binary 
quartic forms, having bounded invariants, satisfying any specified finite set of congruence conditions. Such 
a modification will be crucial for several of the applications, which we now discuss. 

First, we use the latter counting results involving binary quartic forms to understand the average 
size of 2-Selmer groups (as in Theorem ll.lj) . Recall that an element of the 2-Selmer group of an elliptic 
curve E/Q may be thought of as a locally soluble 2-covering. A 2-covering of E/Q is a genus one curve C 
together with maps <j) : C — > E and 8 : C — > E, where (j) is an isomorphism defined over C, and 6 is a degree 
4 map defined over Q, such that the following diagram commutes: 




Thus a 2-covering C — (C, <fi, 9) may be viewed as a "twist over Q of the multiplication-by-2 map on . 
Two 2-coverings C and C are said to be isomorphic if there exists an isomorphism $ : C — > C defined over 
Q, and a rational 2-torsion point P £ E, such that the following diagram commutes: 

+p 
E >E 

e e' 



A soluble 2-covering C is one that possesses a rational point, while a locally soluble 2-covering C is one that 
possesses an R-point and a Q p -point for all primes p. Then we have, as groups, the isomorphisms: 

{soluble 2-coverings}/ - E(Q)/2E(Q); 
{locally soluble 2-coverings}/ ~ = 82(E). 

How does counting elements of 82(E) lead to counting binary quartic forms? There is a result of 
Birch and Swinnerton-Dyer (see [HI Lemma 2]) that states that any locally soluble 2-covering C posseses a 
degree 2 divisor defined over Q, thus yielding a double cover C — » P 1 ramified at 4 points. We thus obtain 
a binary quartic form over Q, well-defined up to GL2(Q)-equivalence! 

To carry out the proof of Theorems 11.11 and II. 3[ we do the following: 

• Given i,BeZ, choose an integral binary quartic form / for each element of S2(Ea,b), such that 

~ y 2 — f( x ) gives the desired 2-covering; 

— the invariants (/(/), </(/)) of / agree with the invariants (A, B) of the elliptic curve (at least away 
from 2 and 3); 

• Count these integral binary quartic forms via Theorem 1.6. The relevant binary quartic forms are 
defined by infinitely many congruence conditions, so a sieve has to be performed. 

• A uniformity estimate must be proven to perform this sieve, and that is by far the most technical part 
of this work. It involves counting integral points, having bounded invariants, in a certain nonreductive 
coregular space of dimension 12. 

To carry out the sieve/uniformity step in the arguments above, we require in particular some auxiliary 
results on the 2-class groups of monogenic cubic fields, which we believe are of interest in their own right. 
We describe these results next. 
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1.3 The mean size of the 2-class group of monogenic cubic fields 

We define a cubic ring to be a commutative ring with unit that is free of rank 3 as a Z-module. For 
example, an order in a cubic field is our prototypical cubic ring. We say that a cubic ring is monogenic if 
it is generated by a single element as a Z-algebra. A monogenized cubic ring is a pair (C, x), where C is a 
monogenic cubic ring and x <E C is an element that generates C over Z. In that case, C is isomorphic to the 
ring Z[X]/(X 3 + rX 2 + sX + t), where f(X) = X 3 + rX 2 + sX + t is the characteristic polynomial of the 
linear map xx : C — > C given by multiplication by x. 

If (C, x) is a monogenized cubic ring, then so is (C, x — n) for any n € Z. We thus define an 
equivalence relation ~ on monogenized cubic rings by (C, x) ~ (C, y) if and only if x — y G Z. This 
leads to a corresponding translation action of Z on monic cubic polynomials, namely n G Z sends /(X) to 
<?P0 = /(X + n), and 1[X]/ f(X) ~ Z[X]/ 5 (X). 

We show that this translation action of Z on monic cubic polynomials f(X) = X 3 + rX 2 + sX + t 
again has two polynomial invariants, having degrees 2 and 3 respectively, and which we again call /(/) and 
J(f). They are defined by 

1(f) := r 2 - 3s, 
J(f) := -2r 3 + 9rs - 27*. 

One easily checks that these quantities do not change if f(X) is replaced by f(X + n) for any n € Z. 
We define the (naive) height H{C,x) of the monogenized cubic ring (C,x) by H(C,x) := H(I(f) 1 J(f)) = 
max{|/(/)| 3 , J(/) 2 /4}, where / as before denotes the characteristic polynomial of xx : C — > C. 

We will be concerned primarily with monogenic cubic fields, i.e., cubic fields whose rings of integers 
are monogenic. We say that (K, x) is a monogenized cubic field if (0 k, x) is a monogenized cubic ring; here 
k denotes the ring of integers in K. In that case, we define the height of the monogenized cubic field (K, x) 
by H(K) :=H(0 K ,x). 

We now order all monogenized cubic fields K = (K, x) by their heights, and ask: what is the average 
size of the 2-torsion subgroups of the class groups of monogenized cubic fields? (It is known that a cubic 
field can be monogenized in at most 12 ways, up to equivalence; see [25] and [27].) We prove the following 
theorem: 

Theorem 1.9 For a cubic field K, let G\2(K) (resp. ClJ(-ftT)) denote the 2-torsion subgroup of the class 
group of K (resp. the narrow class group of K). Let K run through all isomorphism classes of monogenized 
cubic fields ordered by height. Then: 

(a) The average size of G\2(K) over totally real monogenized cubic fields K is 3/2; 

(b) The average size of G\2(K) over complex monogenized cubic fields K is 2; 

(c) The average size ofC\^(K) over totally real monogenized cubic fields K is 5/2. 

Moreover, these averages do not change even if K runs only over those monogenized cubic fields satisfying 
any specified set of local conditions at a finite set of primes. 

It is interesting to note that the averages in Theorem 11.91 are strictly larger than the corresponding 
averages for all cubic fields, when ordered by discriminant; indeed, the averages in the latter case are 5/4, 
3/2, and 2 respectively (see [5j Theorem 5]). This indicates that monogenic cubic fields tend to have more 
2-torsion in their class groups, on average, than general cubic fields! However, a priori, it is also possible 
that this discrepancy arises because we are ordering these two sets of cubic fields differently (the first by 
height, and the second by discriminant). 

We thus prove a result on the average number of 2-torsion elements in the class groups of general 
cubic fields, when they are ordered by height rather than discriminant. To make this precise, let us say that 
(C, x) is a submonogenized cubic ring of index n, or simply an n-monogenized cubic ring, if C is a cubic ring 
and x € C is an element such that x generates a subring of index n in C . Thus a 1-monogenized cubic ring 
is equivalent to a monogenized cubic ring. In general, an n-monogenized cubic ring is a cubic ring having an 
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index n subring that is monogenic, together with a choice of generator for this subring. Note that any cubic 
ring can be n-monogenized for some integer n > 1. 

We dehne the height H(C, x) of an n-monogenized cubic ring (C, x) by H(C, x) := n 2 H(I(f), J(f)) = 
n 2 max{|J(/)| 3 , J(/) 2 /4}, where / as before denotes the characteristic polynomial of xx : C —> C. (The 
factor of n 2 appears so that the height H(C,x) of (C,x) remains comparable with the discriminant A(C) 
of C.) 

Finally, an n-monogenized cubic field is a pair (K,x) for which (0k,x) is an n-monogenized cubic 
ring. We dehne the height H(K,x) of an n-monogenized cubic field (K,x) by H(K,x) := H(0k,x). 

We may now ask: what is the average size of the 2-torsion subgroup of the class group of n- 
monogenized cubic fields, over all isomorphism classes of n-monogenized fields having height less than X 
and n < X s ? We prove the following theorem: 

Theorem 1.10 For a cubic field K , let C^if) (resp. 01% (K)) denote the 2-torsion subgroup of the class 
group of K (resp. the narrow class group of K). Let 5 < 1/4 be any positive constant, and let S(X, 5) denote 
the set of all isomorphism classes of n-monogenized cubic fields K such that H(K) < X and n < X s . Then: 

(a) As X — > oo, the average size of G\.2(K) over totally real cubic fields K in S(X,S) approaches 5/4; 

(b) As X — > oo, the average size of Cl2{K) over complex cubic fields K in S[X,8) approaches 3/2; 

(c) As X — > oo, the average size ofCl^iK) over totally real cubic fields K in S(X,8) approaches 2. 

Moreover, these averages do not change even if K runs only over those n-monogenized cubic fields satisfying 
any specified set of local conditions at a finite set of primes. 

Theorem 11.101 makes it clear that, in a precise sense, the average size of the 2-torsion subgroups of 
class groups of monogenic cubic fields is higher than the corresponding average for general cubic fields! The 
theorem implies that the monogenicity of the ring of integers of a cubic field has a direct influence on the 
behaviour of the class group of the field. In particular, the class groups of monogenic cubic fields do not 
seem to be "random groups" in the sense of Cohen-Lenstra-Martinet [T7] and Malle [35] . 

It would be interesting to have heuristics that explain this phenomenon. In a forthcoming paper, 
in joint work with Jonathan Hanke, we describe the average size s(n) of C\2(K) over n-monogenic cubic 
fields K, where n is fixed; the base case n = 1 is Theorem 11.91 The function s(n) is found to vary in a 
very interesting way with n, and the average of s(n) over all n is then shown to yield the constants in 
Theorem [HOI see [§]. 

We prove Theorem 11.91 by reducing it to an appropriate count of binary quartic forms satisfying a 
certain natural infinite set of congruence conditions. The key observation in this context is that GL2(Z)- 
equivalence classes of integral binary quartic forms parametrize quartic rings having monogenic cubic re- 
solvent rings (in the sense of [4]); this is a theorem of Wood [41], Theorem 4.1.1]. When we restrict to the 
binary quartic forms yielding maximal cubic resolvent rings, then by class field theory, such binary quartic 
forms may then be seen to correspond to the 2-torsion elements in the dual of the class group of monogenic 
cubic fields. By counting these binary quartic forms, we obtain Theorem II. 91 

However, since the set of binary quartic forms that must be counted is defined by infinitely many 
congruence conditions, a sieve must be performed to obtain the correct count. In fact, it is essentially 
the same sieve required to average the number of elements in the 2-Selmer group of elliptic curves (as in 
Theorems O and OJ ! 

While the sieve that we use is very simple, the key ingredient that is required to perform the sieve 
is a certain uniformity estimate on the error, as more and more congrence conditions are introduced, that 
is quite challenging and technical to prove. This uniformity estimate is in some sense the analogue of the 
uniformity estimate in Davenport-Heilbronn's work |24[ Proposition 1] (indeed, this is perhaps the most 
technical part of their paper) and of those in J5J Proposition 23] and [HI Proposition 18]. 

The case at hand seems considerably more difficult. We prove the desired estimate by enlarging 
the space of binary quartic forms, equipped with the action of GL 2 (Z), to the space of pairs of ternary 
quadratic forms, equipped with the action of a certain non-reductive group; this corresponds to embedding 
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the parameter space of all monogenized cubic rings into that of all submonogenized cubic rings. The uni- 
formity estimate is then proven by counting orbits on this enlarged space, in conjunction with a class field 
theory argument that then applies in this context. Although this takes some effort, as a by-product and 
consequence of this work we end up proving Theorem 11.101 

This paper is organized as follows. In Section 2, we study the distribution of GL2 (Z)-equivalence 
classes of binary quartic forms with respect to their fundamental invariants / and J; in particular, we prove 
Theorems ll.6l and ll.8l In Section 3, we describe the correspondence between binary quartic forms and 2-class 
groups of monogenic cubic fields, by generalizing in a natural way the correspondence of Wood [41] . We use 
this to prove Theorem II. 91 (assuming the aforementioned uniformity estimates). 

In Section 4, we prove the uniformity estimates that are necessary to count binary quartic forms sat- 
isfying our desired infinite sets of congruence conditions. In particular, we complete the proof of Theorem ll.9[ 
and also obtain Theorem ll.101 

In Section 5, we conclude by describing the precise connection between binary quartic forms and 
elements in the 2-Selmer groups of elliptic curves. This connection allows us, through the use of certain 
mass formulae for elliptic curves over Q p , to compute the average size of the 2-Selmer groups of elliptic 
curves (or of appropriate families of elliptic curves) via a count of binary quartic forms satisfying a certain 
corresponding infinite set of congruence conditions. We then apply the uniformity results of Section 4 to 
count these binary quartic forms, thus completing the proofs of Theorems 11.11 and 11.31 



2 The number of integral binary quartic forms having bounded 
invariants 

In this section, we derive precise asymptotics for the number of GL 2 (Z)-equivalence classes of irreducible 
integral binary quartic forms having bounded invariants. We also describe how these asymptotics change 
when we restrict to counting only those binary quartic forms satisfying any specified finite set of congruence 
conditions. In particular, we prove Theorems 1 1 . 6lH~51 

To this end, let Vr denote the vector space of binary quartic forms over the real numbers R. We 
express an element / 6 Vr in the form f(x, y) = ax 4 + bx 3 y + cx 2 y 2 + dxy 3 + ey 4 , where a, b, c, d, and e are 
real numbers. Such an / 6 Vr is said to be integral if a, b, c, d, e G Z. 

The group GL/2(R) naturally acts on Vr; namely, an element 7 G GL2(R) acts on f(x,y) by linear 
substitution of variable: 

7 ' f(x,y) = f((x,y) ■ 7)- (3) 

This action of GL 2 (R) on Vr is a left action, i.e., (7172) • / = 7i • (72 • /)■ 

We consider the action of SL^ (R) on Vr , where SL^ (R) C GL 2 (R) is the group of elements in GL 2 (R) 
having determinant equal to ±1. The ring of invariants for this action is generated by two independent 
generators of degrees 2 and 3 which are traditionally denoted by I and J, respectively. If f(x,y) = ax + 
bx 3 y + cx 2 y 2 + dxy 3 + ey 4 , then 

/(/) = I2ae-3bd + c 2 , 

J {f) = 72ace + 9bcd - 27 ad 2 - 27eb 2 - 2c 3 . U 

The quantities /(/) and J(/) are also relative invariants for the action of GL 2 (R) on V: we have I("f ■ f) = 
(det 7) 4 7(/) and J(7 • /) = (det r y) 6 J(f). The discriminant A(/) of a binary quartic form /, being a relative 
invariant of degree 6, can thus be expressed in terms of I and J, namely, A(/) = (4/ 3 (/) — J 2 (/))/27. We 
define the height H(f) of a binary quartic form / by: 

H(f) := H(I, J) - max{|/| 3 , J 2 /4}. (5) 

The action of GL2(Z) on Vr evidently preserves the lattice V% consisting of the integral elements 
of Vr, and so we may ask: how many GL2(Z)-classes of forms are there having height at most XI More 
precisely, we may ask: how many GL2(Z)-classes of forms are there with height at most X and a given 
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number of real roots? To this end, for i = 0, 1, and 2, let V^ denote the set of elements in Vz having 4 — 2i 
real roots in P 1 (M) . Then the main theorem of this section is the following restatement of Theorem 11.61 

Theorem 2.1 For any GL2(Z) -invariant set S C Vz, let N(S; X) denote the number of 'GL2(Z)- equivalence 
classes of irreducible elements f E S satisfying H(f) < X . Then 

(a) N(V^;X) = -1_ C(2 )X 5 / 6 + 0(X 3 / 4+£ ); 

(b) N{V^-X) = ^c(2)x 5/6 + 0(x 3/4+£ ); 

(c) N{vi 2) -X) = A-((2)X 5 / 6 +0(X 3 /^). 

Our strategy to prove Theorem 12.11 is as follows. In §2.1[ we develop the necessary reduction 
theory needed to establish convenient fundamental domains for the action of GL2(Z) on Vr. The primary 
difficulty in counting points in these fundamental domains is that they are not compact, but instead have 
a rather complicated cuspidal region going off to infinity. To deal with and effectively handle this cusp, in 
£12.21 we investigate the distribution of reducible and irreducible points inside these fundamental domains. 
Specifically, we prove that the cusp contains only reducible points, while the remainder of the domain outside 
the cuspidal region contains primarily irreducible points. In £12.31 we develop a refinement of an averaging 
method introduced in [5], [6] to count points in these fundamental regions in terms of the volumes of these 
domains. The volumes of the fundamental regions are then computed in £12.41 completing the proof of 
Theorem O 

Finally, in £12.51 we prove a stronger version of Theorem l2. II where we restrict to counting those binary 
quartic forms whose coefficients satisfy finitely many congruence conditions. We will require this stronger 
result in an essential way when we prove Theorems 1 1 . 1 ( IT731 and 11.91 in Sections 5, 5, and 3, respectively. 



2.1 Reduction theory 

(i) i 

For i = 0, 1, and 2, let V^ denote the set of points in Vr having exactly 4 — 2i real roots in P . Then it 
is easy to see that V^ is the set of definite forms in Vr, i.e., forms f(x,y) that take only positive or only 
negative values when evaluated at nonzero vectors (xo,yo) € M 2 . Let V^ 2+ ' (resp. V^ 2 ') denote the subset 

( 2^ 

of consisting of the positive definite forms (resp. negative definite forms). Note that for i = 0, 1, and 2 

we have V Z W = n V z . We analogously define V Z W = n V z for i = 2+ and 2-. 
We have the following facts (see 20 [ Remark 2]): 

1. The set of binary quartic forms in Vr having fixed invariants I and J consists of just one GL2(M)-orbit 
if 4/ 3 — J 2 < 0; this orbit lies in V^ 1} . 

2. The set of binary quartic forms in Vr having fixed invariants / and J consists of three GL2(R)-orbits 
if 4/ 3 — J 2 > 0; in that case, there is one such orbit from each of Vg , V^ 2+ \ and V^ 2- "*. 

We construct fundamental sets iJy for the action of GL2(M) on for each i G {0, 1, 2+, 2—}. Let 
us first consider the case where i is 0, 2+, or 2—. In that case, if a form / £ vS 1 ^ has invariants / and J, then 
4/ 3 — J 2 > 0; furthermore, / is then the unique form in Vg up to GL2 (M)-equivalence having invariants / 
and J. Similarly, if / G has invariants / and J, then 4/ 3 — J 2 < and / is then the unique form in Vr, 
up to GL2(K)-equivalence, having invariants / and J. 

Since I(g ■ f) = (det g) 4 I(f) and J(g ■ f) = (det 3 ) 6 J(/), it is clear that two forms /i,/ 2 G 
are GL2(K)-equivalent if and only if there exists a positive constant A G K with I(fi) = \ 2 I(f2) and 
J(/i) = A 3 J(/2). In particular, given a pair (I, J) 7^ (0,0), there always exists a positive constant A such 
that H(\ 2 I,\ 3 J) — 1. It follows that, for i = 0, 2+, or 2— (resp. for % = 1), a fundamental set Ly^ for 
the action of GL 2 (R) on can be constructed by choosing one form / G Vj^ for each (/, J) such that 
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4 0) - p y -ixy 3 -± 2/ 4 :-2<t<2} 

l { v = {^v - |^y 3 + §^ 4 ■ -i < * < 1} u + - ^y 4 ■. -2 < t < 2} 

^ 2+, = {^ 4 -^^ + ^v +S ' : -2<*<2} 

Table 1: Explicit constructions of fundamental sets Ly for GL2(R)\V^ I ' ) 
H(I, J) = 1 and 4/ 3 — J 2 > (resp. 4/ 3 — J 2 < 0). TableQ]provides explicit constructions of such fundamental 

(i) U) 

sets Ly. The key fact that we use about these chosen fundamental sets L v is that the coefficients of all 
binary quartic forms in these Ly are uniformly bounded; i.e., the Ly all lie in a compact subset of Vr. Note 
that for any fixed h £ GL2(M) (or, indeed, for any h lying in a fixed compact subset Go C GL2(K)), the 
set hLy 1 is also a fundamental set for the action of GL2 (K) on Vg® , and again all coefficients are uniformly 
bounded. 

We will have need for the following lemma, whose proof is postponed to §3.81 

(i) 

Lemma 2.2 Let f be an element in having nonzero discriminant. Then the order of the stabilizer of f 
in GL 2 (K) is 8 if i = or 2, and 4 if i = 1. 

Let T denote Gauss's usual fundamental domain for GL2(Z)\GL2(R) in GL2(K). Then T may be 
expressed in the form T = {nakX : n(u) £ N'(t), a(t) £ A' , k £ K, A £ A}, where 

N'(a) = ^l i y.u£„(a)Y A' = ^^ \ : t > , A = j ( X x ) : A > o} , (6) 

and K is as usual the (compact) real orthogonal group S02(R); here v(u) is a union of one or two subintervals 
of [— |, i] depending only on the value of a £ A' . 

For i = 0, 1, 2+, and 2—, let 2 • m denote the cardinality of the stabilizer in GL2(M) of an irreducible 
element v £ V^. Then, by Lemma [2.21 we have uq = 4, Tlx = 2, 7J2+ = 4, and na- = 4. Thus for any 
h £ GL2 (M) , we see that IFhLy 1 is essentially the union of rij fundamental domains for the action of GL2 (Z) 
on ; here, we regard as a multiset, where the multiplicity of a point x in is given by the 

cardinality of the set {g £ T : x £ ghL^y}. Thus a GL 2 (Z)-equivalence class x in V^' is represented in 
this multiset m(x) times, where m{x) — #StabGL 2 (R)( ;E )/ : !^StabGL2(z)( ;z: )- Since the stabilizer in GL2(Z) of 
an element x £ V% always contains the identity and its negative, m(x) is always a number between 1 and rij. 

We conclude that, for any h £ GL 2 (M), the product m ■ N(V^;X) is exactly equal to the number 

(i) 

of irreducible integer points in ThL v having absolute discriminant less than X, with the slight caveat that 
the (relatively rare — see Lemma f2.4|) points with GL2(Z)-stabilizers of cardinality 2r (r > 1) are counted 
with weight 1/r. 

As mentioned earlier, the main obstacle to counting integer points of bounded discriminant in a 
single domain ThL v is that the relevant region is not compact, but rather has a cusp going off to infinity. 
We simplify the counting in this cuspidal region by "thickening" the cusp; more precisely, we compute 
the number of integer points of bounded discriminant in the region ThL^y by averaging over lots of such 

(i) 

fundamental regions, i.e., by averaging over the domains ThL v where h ranges over a certain compact 
subset Go £ GL2 (M) . This refinement of the method of [6] is described in more detail in ^2.31 

However, we first turn in £12.21 to estimating the number of reducible points in the main bodies (i.e., 
away from the cusps) of our fundamental regions. 
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2.2 Estimates on reducibility 

We consider the integral elements in the region 1Zx{hLy) := {w 6 FhLy : |-ff(u>)| < X} that are reducible 
over Q, where h is any element in a fixed compact subset Go of GL^fR). Note that if a binary quartic form 
ax 4 + bx 3 y + cx 2 y 2 + dxy 3 + ey 4 satisfies a = 0, then it is automatically reducible over Q, since y is a factor. 
The following lemma shows that for integral binary quartic forms in IZx (hLy ) , reducibility with a ^ does 
not occur very often: 



Lemma 2.3 Let h G Go be any element, where Go is any fixed compact subset o/GL^R). Then the number 
of integral binary quartic forms ax 4 + bx 3 y + cy 2 + dxy 3 + ey 4 € 7Zx(hL v ) that are reducible over Q urai/i 
a 7^ is 0(X 2 / 3+e ) 7 where the implied constant depends only on Go- 

Proof: Let f(x, y) = ax 4 + bx 3 y + cx 2 y 2 + dxy 3 + ey 4 be any element in 1Zx(hLy). Then since TZx(hLy) C 

N'A'KAhL$ (where < A < X 1 / 24 ), and /iL^ lies in a fixed compact set, we see that a = 0(X 1 / 6 ), 
b = OiX 1 / 6 ), c = 0(X 1 /6), ad = 0(X 2 ^ 6 ), bd = 0(X 2 / 6 ), and ae = 0(X 2 ^ 6 ). In particular, the latter 
estimates clearly imply that the number of points in TZx{hL v ') with a ^ and e = is 0(X 4 / 6+e ). 

Let us now assume that a ^ and e ^ 0. We first estimate the number of forms that have a rational 
linear factor. The above estimates show that the number of possibilities for the quadruple (a, b, d, e) is at 
most 0(X 4 ^ 6+e ). If px + qy is a linear factor of f(x, y), where p,a€Z are relatively prime, then p must be 
a factor of a, while q must be a factor of e; they are thus both determined up to 0(X e ) possibilities. Once 
p and q are determined, computing f{—q,p) and setting it equal to zero then uniquely determines c (if it is 
an integer at all) in terms of a, b, d, e,p, q. Thus the total number of forms / €E lZx(hL v ) having a rational 
linear factor and a ^ is 0(X 4//6+e ). 

We now estimate the number of binary quartic forms in IZx {hLy) that factor into two irreducible 
binary quadratic forms over Z, say 

ax 4 + bx 3 y + cx 2 y 2 + dxy 3 + ey 4 = (px 2 + qxy + ry 2 ) ( — x 2 + sxy H — y 2 J 

\p r ) 

where p, q, r, s € Z and p,q,r are pairwise relatively prime. Since ae = 0(X 2 / 6 ) and a,e ^ 0, the number 
of possibilities for the pair (a, e) is 0(X 2 / 6+e ). We then see that p divides a and r divides e, and hence the 
number of possibilities for (p, r), once a and e have been fixed, is bounded by 0(X e ). 
Next, equating coefficients, we see that: 

a 7 

-q + ps = b, 

P (7) 

e w 

— q + rs = d. 
r 

We split into two cases. We first consider the case where ^ 7^ f, i.e., the linear system ([7J in 
the variables q and s is nonsingular. Then the values of b and d uniquely determine q and s, and so the 
total number of quadruples (a,b,d,e) — and hence the total number of octuples (a,b,d,e,p,r,q,s)) — is at 
most 0(X 4 / 6+e ). Furthermore, once this octuple has been fixed, this also then determines c by equating 
coefficients of x 2 y 2 . Hence there are at most 0(X 4 ^ 6+€ ) possibilities for (a,b,c,d,e) in this case. 

Next, we consider the case where ^ = 2, so that the system ([7]) is singular. In this case, the value 

of b determines the value of d uniquely, namely d — (r/p)b. We have already seen that there are 0(X 2 ^ 6+e ) 
possibilities for the quadruple (a, e,p, r). Since there are only 0(X 1 / 6 ) choices for each of b and c, and then 
d is determined by b, the total number of choices for (a, b, c, d, e) is again 0(X^ 6+£ ), as desired. □ 

Finally, we have the following lemma which bounds the number of GL2 (Z)-equivalence classes of 
integral binary quartic forms having large stabilizers inside GL2(Z); we defer the proof to 



Lemma 2.4 Let h £ Go be any element, where Go is any fixed compact subset of GLa(K). Then the number 
of integral binary quartic forms ax 4 4 
has size greater than 2 is 0{X 3 / 4+e ). 



of integral binary quartic forms ax 4 + bx 3 y + cx 2 y 2 + dxy 3 + ey 4 £ lZx{hL^y) whose stabilizer in GL2(Z 



12 



2.3 Averaging and cutting off the cusp 

Let Go be a compact left .fT- invariant set in G that is the closure of a nonempty open set and in which every 
element has determinant greater than or equal to 1. Then for i = 0, 1, 2+, and 2—, we may write 

( z ' >~ — dh ' (8) 

' h J he Go 

(i) 

where V^ rr denotes the set of irreducible elements in VJ rr and L is equal to L v . The denominator of the 
latter expression is an absolute constant Cq greater than zero. 

More generally, for any GL 2 (Z)-invariant subset S C V% , let N(S;X) denote the number of irre- 
ducible GL2(Z)-orbits in S having height less than X. Let S" rr denote the subset of irreducible points of 5. 
Then N(S;X) can be similarly expressed as 



IheGn #i x e ThL n S ™ : H i x ) < X } dh 

iv yo, yv; = - 

Given x € let Xl denote the unique point in L that is GL 2 (K)-equivalent to x. We have 



N(S; X) = F) — . (9) 

C G 



N(S;X) = -^ V f #{g e T : x = ghx L }dh. (10) 

C g[ xes ^ AeGo 

H(x)<X 

For a given x £ S* lrr , there exist a finite number of elements gi,...,g n G GL 2 (IR) satisfying gjXL — %■ We 
then have 



/ #{.g E T : x = ghx L }dh = V] / #{# E J 7 : gh = gj}dh = V] / 



J 

As g?/i is an invariant measure on G, we have 



9j 



V/ dh = Y\l d 9 = J2 #{h € G : gh = gj}dg = #{h e G : x = ghx L }dg. 

j JheG nr-^g 3 ■ JgeGog-'nr- 1 ■ J g er J ger 

Therefore, 

iV(S;A) = -L I #{heG :x = ghx L }dg. (11) 



Go i£S 
H(a:)<X 



\- ( S irr n 5 G L : H(x) < X} dg (12) 

#{x € S irr nn( rl t )\kG Q L : H(x) < X}t~ 2 'dn d x t d x X dk . (13) 



C G Jg£N>(t)A>AK 

Let us write B(n,t,X,X) = n^ 1 t )XG Ln{x e V® : H{x) < X}. As KG = G and J K dk = 1, we have 
N(S;X) = -Lr [ #{xeS irl nB(n,t,X,X)}t- 2 dnd x td x X. (14) 

To estimate the number of lattice points in _B(n,i, A, A), we have the following proposition due to 
Davenport 22 . 
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Proposition 2.5 Let 1Z be a bounded, semi-algebraic multiset in R n having maximum multiplicity m, and 
that is defined by at most k polynomial inequalities each having degree at most t. Let TV denote the image 
of 1Z under any (upper or lower) triangular, unipotent transformation of W 1 . Then the number of integer 
lattice points (counted with multiplicity) contained in the region TV is 

Vol(ft) + 0(max{Vol(7l), 1}), 

where Yo\(lZ) denotes the greatest d-dimensional volume of any projection of 1Z onto a coordinate subspace 
obtained by equating n — d coordinates to zero, where d takes all values from 1 ton— 1. The implied constant 
in the second summand depends only on n, m, k, and I. 

Although Davenport states the above lemma only for compact semi-algebraic sets 7Z C M. n , his proof adapts 
without significant change to the more general case of a bounded semi-algebraic multiset 7Z C M n , with the 
same estimate applying also to any image TV of 7Z under a unipotent triangular transformation. 

(i) 

By our construction of the L v , the coefficients of the binary quartic forms in GqL are all uniformly 
bounded. Let C be a constant that bounds the absolute value of the leading coefficient a of all the forms 
ax 4 + bx 3 y + cx 2 y 2 + dxy 3 + ey in GqL. 

We then have the following lemma on the number of irreducible lattice points in B(n, t, A, X): 



Proposition 2.6 The number of lattice points (a, b, c, d, e) in B(n, t, A, X) with a ^ is 

( if ^ < 1; 

\ Vol(B(n, t, A, X)) + 0(max{C 4 i 4 A 16 , 1}) otherwise. 



Proof: If CX/t < 1, then a = is the only possibility for an integral binary quartic form ax 4 + bx 3 y + 
cx 2 y 2 + dxy 3 + ey 4 in B(n,t, A, X), and any such binary cubic form is reducible. If C\/t > 1, then A and 
t are positive numbers bounded from below by (v / 3/2)/C and respectively. In this case, one sees 

that the projection of B(n,t, X,X) onto a = has volume 0(C 4 i 4 A 16 ), while all other projections are also 
bounded by a constant times this. The lemma now follows from Proposition 12.51 □ 

In (fT4|) . since L (and therefore also GqL) only contains points with height at least 1, we observe (by 
the definition of B(n,t, X, X)) that the integrand will be nonzero only if t < CX and A < X 1 / 24 . Thus we 
may write, up to an error of 0(X 2 / 3+e ) due to the estimates on reducible forms in Lemma |2~51 that 

N(V^-X) = —- / / (Vo\(B(n,t,X,X)) + 0(m&x{C 4 t 4 X 16 ,l}))t~ 2 dnd x td x X. (15) 



(i) i — i — f 

Cq J\=V3/(2C) Jt=V3/2 JN'(t) 

The integral of the second summand is immediately evaluated to be 0(X 3 ^ 4 ). Meanwhile, the integral of 
the first summand is 

v l/24 

1 r ' rX 



Cq^ JheGo K J \=^Z/{2C) Jt=C\J N'{t) 

However, Vol(7^x(^)) is independent of h; also, since Vo\(B(n, t, A, X)) — 0(X 20 ), by carrying out the 
integration in the second term of (fTBl . we see that that this term is also 0(X 3 ^ 4 ). We conclude that 

N(yi l) ;X) = Yo\(n x {L))/m + 0(X 3 ' 4 ^). (17) 
To complete the proof of Theorem 12.11 it thus remains only to compute the volume Vo\(7Zx(L)). 



Vo\(Kx(hL))dh-0( _ Vol(B(n,t,X,X))t- 2 dnd x td x x). (16) 
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2.4 Computing the volume 

Let i be equal to 0, 1, 2+, or 2—. In this section, we compute the volume of 1Zx{Ly). To accomplish this, 
we define the usual subgroups N,A + , N, and A of GL 2 (M) as follows: 



1 u 
1 



-l 



t 



1 

u 1 



N = {n(u) : u £ R}, where n(u) = 

A = {a(t) : t £ M x }, where a(t) - 

N = {n(u) : u £ R}, where n{u) - 

A = {^^ where A > 0. 

It is well-known that the product map N x A x N — > SL 2 (R) maps bijectively onto a full measure set in 
SL 2 (M). This decomposition gives a Haar measure dg on SL 2 (R), namely, dg = t~ 2 dfid x tdn = t~ 2 dud x tdu. 

Let Ry := AL V \ Then for each (/, J) £ R x R, the set iJ^ contains at most one point pjj having 
invariants / and J. Let Ry\x) denote the set of all those points in Ry having height less than X. Then 

Yo^lZxiLy 1 )) = Vo1(J-sl 2 where J r sL 2 is the fundamental domain TV' A' K for the action of SL 2 (Z) 

on SL 2 (R) (here N', A', and K are as in ©). 

The set R 1 ^ is in canonical one-to-one correspondence with the set {(/, J) £ R x R : I 3 — J 2 /4 > 0} 
if i = 0, 2+, or 2-, and with {(/, J) £ R x R : I 3 — J 2 /4 < 0} if i = 1. There is thus a natural measure on 

(i) 

each of these sets R v , given by <ir = dl dJ. We have the following proposition: 
Proposition 2.7 For any measurable function (j) on Vr, we have 

^g-pfj)dgdIdJ=[ Mv)dv. (18) 
Airii Jrw Jsl 2 (m) ^y K w 

We know that SL 2 (M) • is an n^-fold cover of V^ l \ The proposition then follows from a Jacobian 
computation and can be verified directly. 

There is an alternative method of arriving at (|18l) that does not require one to use the explicit 
formulas for the fundamental sets L v . We prove the following proposition, which will also be of use to us 
in the sequel. 

Proposition 2.8 Let dg be the Haar measure on SL 2 (C) obtained from the usual NAN decomposition. Let 
dv be the standard Euclidean measure on Vc, the C-vector space of all binary quartic forms with complex 
coefficients. Let R be a subset ofVc that contains exactly one point pi t j having invariants equal to L and J 
for each pair (J, J) £ C x C and such that GoR is measurable for any measurable set Go C SL 2 (C). 
Then for any measurable function <f> on Vc, we have 



, , </% • pi.j)dgdLdJ = I 4>(v)dv 

ztn JR JSL 2 (C) JveVc 

where n is equal to 8, the size of the stabilizer of any point v G Vc having nonzero discriminant. 

Proof: We first prove that SL 2 (C) • R is an 8- fold cover of a full measure set in Vc- To do this, we consider 
the set i? consisting of the forms qij — x 3 y — -xy 3 — -^y 4 for each (/, J) £ C x C. Note that qi j has 
invariants equal to / and J. 

Now if pjj £ R has nonzero invariants / and J such that AI 3 — J 2 ^ 0, then there exist 8 elements 
gij £ SL 2 (C) such that gi t j -pi,j = qi.j. This can be seen in the following steps. We first apply an SL 2 (C)- 
transformation on pij to send one of its four distinct roots in P 1 (C) to the point [1 : 0]. The resulting 
quartic is of the form bx 3 y + cx 2 y 2 + dxy 3 + ey 4 with 6^0. Secondly, we use a diagonal matrix and then 
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a lower triangular matrix of SL2(C) to transform the quartic into the form x 3 y + d'xy 3 + e'y 4 . Because this 
quartic still has invariants equal to I and J, we see that it is in fact equal to qi,j. Finally, identically as in 
the proof of Lemma 1 2. 2 [ the stabilizer in SL2(C) of any binary quartic form in Vfc having invariants I =/= 
and J 7^ has size 8. Thus SL2(C) ■ R is an n-fold cover of a full measure set in Vc- 

The proposition can be checked directly for R = Rq via a Jacobian computation. We may now use 
this fact to derive the result for general R as follows: 

4>{v)dv = J- ( [ <p(g- q LJ )dgdr = J- / f <f>(ggi,j • Pi,j)dgdr 
veSh 2 (C)-R 11 J R a Jsl 2 (C) 11 JrJsl 2 (C) 

2 

27 



<j>(9 • Pi,j)dgdr, 

R JSL 2 (C) 



where the last equality follows from the fact that SL 2 (C) is a unimodular group (see [Mj Chapter 8]), i.e., 
the left Haar measure dg is also a right Haar measure on SLi2(C). □ 



Proposition 12 .81 along with the principle of permanence of identities, implies an analogous result 
when C is replaced with K or Q p for any prime p. In particular, it implies Proposition 12.71 which may now 

be used to compute the volume of the multiset TZx(Ly): 



f dv= [ dv^^-j { dg dl dJ = J- • C(2) / dldJ. 

Jnx(L<p) J^-R^ix) 27 yflW(x)^ SLa 27 JR<v } (x) 

When i = 0, 2+, or 2—, we compute J R V) (X \ dl dJ to be 



(19) 



X 1 / 3 2I 3 ^ 2 X ' 

/ / dIdJ= [ 4I 3 / 2 dI=-I 5 / 2 \f /3 = -X 5/6 - (20) 

Ji=o Jj=-nw* Ji=o 5 5 

Meanwhile, } R W^ X ) dl dJ is equal to 

{ r dldJ - Vol(< 0) (X j) = 8X 5 ' e - -X 5 / 6 = ^X 5 ' 6 . (21) 



We conclude that 



{16 
— ■ C(2)X 5 / 6 for i = 0, 2+, and 2-; 

fi4 ( 22 ) 
^•C(2)^ for^l. 

As no = ri2+ = Ti2— = 4 and ri\ = 2, equations (|17l) and (1221) now immediately imply Theorem 12. II 

To deduce Theorem 11.81 from Theorem 12.11 we require a count of the number of eligible pairs 
(7, J) G Z x Z satisfying H (I, J) < X. We use the following lemma which we prove in H3.8I 

Lemma 2.9 The set of eligible (J, J) € Z X Z is a union of 9 lattices, each of index 243 in Z x Z. 

The next proposition is now a simple application of Proposition 12.51 

Proposition 2.10 Let Nf j{X) and NT j(X) denote the number of eligible (I, J) G Z x Z satisfying 
H(I, J) < X that have positive discriminant and negative discriminant, respectively. Then we have 

(a) N+ J (X) = ^X 5 / 6 + 0(X 1 / 2 ); 

(b) N- J (X) = —X*/ e + 0(X 1 / 2 ). 
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Proof: Let denote the set € R 2 : |i| < X 1 / 3 , \j\ < 2X 1 / 2 , 4i 3 - j 2 > 0} and let denote the 

set £ K 2 : \i\ < X 1 / 3 , \j\ < 2X 1 / 2 , i 3 - j 2 < 0}. The sizes of the projections of onto smaller- 

dimensional coordinate hyperplanes are all bounded by 0{X 1 / 2 ). Using Proposition 12.51 and Lemma [231 we 
then see that Nfj{X) = ^Vol(R^) + 0(X X / 2 ). The volumes of R\ and were computed in and 
(|2ip. respectively, and the proposition follows. □ 

Theorem 11.81 now follows from Theorem 12.11 and Proposition 12.101 



2.5 Congruence conditions 

In this subsection, we prove a version of Theorem 12 . 1 1 where we count integral binary quartic forms satisfying 
any finite specified set of congruence conditions. 

Suppose S is a subset of Vz defined by finitely many congruence conditions. We may assume that 
S C Vz is defined by congruence conditions modulo some integer m. Then S may be viewed as the union of 
(say) k translates Ci, . . . , Ck of the lattice m- Vz- For each such lattice translate Cj, we may use formula (fl~4|) 



and the discussion following that formula to compute N(Cj H V%^;X), where each d-dimensional volume is 
scaled by a factor of l/m d to reflect the fact that our new lattice has been scaled by a factor of m. With these 
scalings, the maximum volume of the projections of B(n,t, A, X) is seen to be at most 0(t A 16 ). Analogous 
to Proposition 12.61 we see that the number of points (a, 6, c, d, e) in B(n, t, A, X) n Cj with a ^ is 



cx 

1 



if if < 1; 



-Vol(£(n, t, A, X)) + 0(max{C 4 i 4 A lb , 1}) otherwise. 

Carrying out the integral for N(Cj D V^;X) as in (JTTJ) , we obtain, up to an error of 0{X 2 l 3+t ) 
corresponding to the reducible points in Lemma 12.31 that 

N(C 3 n vf;X) = Vol ftM £ y)) + 0{x V^y 

Tli ■ 171 

Summing over j, we thus obtain 

N(S n ; X) = fcVol (^(4 } )) + (X 3 ^). (23) 
m ■ mP 

For any set S in Vz that is definable by congruence conditions, let us denote by /-i p (S) the p-adic 
density of the p-adic closure of S in Vz p , where we normalize the additive measure \i p on Vz p so that 
/i p (Vz p ) = 1. We then have the following theorem: 

Theorem 2.11 Suppose S is a subset ofVz defined by finitely many congruence conditions. Then we have 

N(SnV^;X) = JJ/ip(5) + 0(X 3 ^), (24) 

p 

where n P (S) denotes the p-adic density of S in Vz, and where the implied constant depends only on S. 
Theorem 12.111 follows from Equations (IT7|) and (|2"3"|) . together with the identity km~ 5 = Yipl^pi^)- 

3 The mean number of 2-torsion elements in the class groups of 
monogenic cubic fields 

In this section, we determine the mean size of the 2-torsion subgroup of the class group of monogenic cubic 
fields, where these cubic fields are ordered by the heights of their defining cubic polynomials. In particular, 
we prove Theorem 11.91 (assuming certain uniformity estimates which will be proven in Section 4) . 



17 



3.1 The parametrization of monogenic cubic rings 



We work over a principal ideal domain R that we will primarily assume to be equal to Z, Q, R, Z p , or Q p 
for some prime p. 

Let Ur denote the space of binary cubic forms over R, i.e., polynomials of the form g(X, Y) = 
qX 3 + rX 2 Y + sXY 3 + tY 3 with q, r,s,t€ R. We call qX 3 the leading term of g(X, Y), and q the leading 
coefficient. A monic binary cubic form is defined to be a binary cubic form whose leading coefficient is equal 
to 1, i.e., it is of the form X 3 + rX 2 Y + sXY 2 + tY 3 . We denote the set of monic binary cubic forms over 
R by Ur,i C Ur. 

Every monic binary cubic form g(X, Y) = X 3 + rX 2 Y + sXY 2 + tY 3 naturally gives rise to a cubic 
ring C = R[X]/(X 3 + rX 2 + sX + 1) over R which is monogenic, i.e., the ring C is generated over R by a 
single element X. The element X is then said to be a monogenizer of C . We define a monogenized cubic 
ring to be a pair (C, x) such that C is a monogenic cubic ring over R and a; is a monogenizer of C . 

Two monogenized cubic rings (C,x) and (C',x) are said to be isomorphic if there exists a ring 
isomorphism cj> ; C C such that (f>(x) — x' + u for some u £ R. In particular, the monogenized cubic rings 
(C, x) and (C, x + u) are isomorphic. Thus it is natural to consider the corresponding monic integral binary 
cubic forms g(X, Y) and g(X + u, Y) to be equivalent too, leading to a natural action of the subgroup 

of SL 2 (R) on Ur,!, given by 7 • g(X,Y) = g((X,Y) ■ 7). 

We may summarize the above discussion in the following proposition, which is essentially the same 
as Proposition 4.3.1 in [41] . 

Proposition 3.1 The FR^-orbits on Ur,% are in bijective correspondence with isomorphism classes of mono- 
genized cubic rings over R. 

Next, if g(X, Y) = X 3 + rX 2 Y + sXY 2 + tY 3 , then one easily sees that the quantities 

1(g) := r 2 -3 S , 

J(g) := -2r 3 + 9rs-27t [ ' 

are invariant under the action of Fr i. The discriminant A(g) of the binary cubic form g can be expressed 
in terms of these basic invariants 1(g) and J(g), namely, A(<?) = (4I 3 (g) — J 2 (g))/27. We again define the 
height of g by 

H(g) := H(I, J) = max{|/ 3 (g)|, J 2 (g)/4}. 

If C = R[X]/g(X), then the height H(C,X) of the monogenized cubic ring (C, X) is naturally defined by 
H(C,X):=H(g). 



3.2 Counting monogenized cubic rings by height 

We begin with the observation that the -F^i-orbits on Uz,i are specified uniquely by their invariants (/, J). 
Indeed, the set 

,9> = {X 3 + rX 2 Y + sXY 2 +tY 3 : r e {-1, 0, 1}; s, t e 1} (27) 

is clearly a fundamental set for the action of Fz,i on Uz,i- Given g(X, Y) — X 3 + rX 2 Y + sXY 2 + tY 3 e ,y 
with invariants I, J, the value of r e { — 1,0,1} is determined by the value of J (mod 3) since J = —2r 3 
(mod 3). Once r is so determined, s is determined by the value of I and then t is determined by the value 
of J. 

This raises the question as to which pairs (I, J) of invariants can actually arise for a monic binary 
cubic form. This question is answered by the following proposition: 

Proposition 3.2 The pair (I, J) occurs as the invariants of a monic binary cubic form if and only if one 
of the following conditions is satisfied: 
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(a) 1 = (mod 3) and J = (mod 27); 

(b) 1=1 (mod 9) and J = ±2 (mod 27); 

(c) 1=4 (mod 9) and J = ±16 (mod 27); 

(d) 7=7 (mod 9) and J = ±7 (mod 27). 

Proof: Recall that 5? ', as defined in (|2~T1) . is a fundamental set for the action of Fz,\ on Uz,i- Therefore, a 
pair (J, J) occurs as the invariants of some monogenic binary cubic form in Uz,i if and only if it occurs as 
the invariants of some element in ,y. 

Suppose that I, J give the invariants of g(X, Y) = X 3 + rX 2 Y + sXY 2 + tY 3 e i.e., I = r 2 - 3s 
and J = — 2r 3 + 9rs — 27t. If I = r 2 — 3s = (mod 3) then r = 0, implying that 27 | J. This is condition (a). 

If I is not divisible by 3, then r = 1 or — 1 and thus 1=1 (mod 3). Thus I must be congruent to 1, 4, 
or 7 (mod 9), which happens exactly when s is congruent to 0, 2, or 1 (mod 3), respectively. Because r 2 = 1, 
we see that J = r(9s — 2) (mod 27). It follows that I = 1,4,7 (mod 9) corresponds to J = ±2, ±16, ±7 
(mod 27), respectively, yielding conditions (b), (c), and (d). 

The converse also follows easily by reversing the above arguments. □ 

We now have the following lemma which bounds the number of reducible monic binary cubic forms, 
up to Fz.i -equivalence, having height less than X. 

Lemma 3.3 Let 5? be the fundamental set for the action of Fi.i on Uz,i as defined by \27$ . Then the 
number of forms g € ,y such that g is reducible and H(g) < X is 0{X 1 / 2+<i ). 

Proof: First, we note that if g(x, y) = x 3 + rx 2 y + sxy 2 + ty 3 6 satisfies H(g) < X, then since |/(g)| 3 = 
\r 2 - 3s| 3 < H(g) < X, we see that s = Opf 1 / 3 ). Then since J{g) 2 /A = (2r 3 + 9rs - 27<) 2 /4 < H(g) < X, 
this in turn implies that t = 0{X 1 / 2 ). 

Let us now count such forms g that are reducible. If g(x, y) = x 3 + rx 2 y + sxy 2 + ty 3 £ 5? satisfies 
t = 0, then g is reducible, and the number of forms j £ y with t = and H(g) < X is the number of 
possible values for r and s, namely 3 • 0(X 1 ^ 3 ) = 0(X 1 ^ 3 ). 

Next, we consider those reducible forms g(x, y) = x 3 + rx 2 y + sxy 2 + ty 3 e 5? satisfying H (g) < X 
and t 7^ 0. If x — my is a factor of g, then m | t. Therefore, if we fix t ^ 0, then there are at most t e = 0(X e ) 
choices for m. Moreover, once r, t, and m are fixed, then setting g(m, 1) equal to determines s. Since 
t = 0{X 1 / 2 ), and there are at most 3 possible values for r, it follows that there are at most 0(X 1 / 2+e ) such 
reducible forms g £ 5? with height less than X. This completes the proof. □ 

Let uj^\ (resp. U% \) denote the set of monic binary cubic forms over Z having positive (resp. 
negative) discriminant. For an Fzj-invariant set S C Uz,i, let N(S; X) denote the number of -F^i-equivalence 
classes of irreducible elements g € S satisfying H(g) < X. The following proposition gives asymptotic 
formulas for N(U^};X) and N{U^};X): 

Proposition 3.4 We have 

(a) N (ui%X) = ^X 5 / e + 0(X^); 

(b) N{U^;X) = ^X^ + 0{X^). 

Proof: A monic binary cubic form is determined up to equivalence by its invariants / and J. It therefore 
suffices to count integer pairs (I, J) such that: |/| < X 1 / 3 , \ J\ < 2X 1 ^ 2 , and (I, J) occurs as the invariants 
of some monic binary cubic form g. Proposition 13.21 now reduces the proof of Proposition 13.41 to counting 
integer points {(I, J) ■ H{I,J) < X} satisfying the congruences (a)-(d) of Proposition 13.21 The proof of 
Proposition 12.101 now yields the result. □ 

We saw in §3.1 that isomorphism classes of monogenized cubic rings are parametrized by i^i-orbits 
on Uz.i- Since irreducible forms in Uz,i correspond to monogenized orders in cubic fields, we have the 
following corollary: 
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Corollary 3.5 Let M3 (X) (resp. M3 (X)) denote the number of isomorphism classes of monogenized 
orders in cubic fields having height less than X and positive (resp. negative) discriminant. Then 

(a) M(°)(X) = —X 5 / 6 + 0(X 1 / 2+e ); 

135 

(b) M«(X) = ^^ 5/6 + 0(X 1 ' 2+t ). 

3.3 The parametrization of quartic rings having monogenic cubic resolvent rings 

In this subsection, we discuss the parametrization of quartic rings over a base ring, i.e., rings that are free 
of rank 4 as a module over the base ring. For simplicity, we choose our base ring R to be a principal ideal 
domain having characteristic not equal to 2, which we will again primarily assume to be Z, Q, R, Z p , or Q p 
for some p. 

Let Wr denote the space of pairs (^4, B) of ternary quadratic forms having coefficients in R. We 
write an element (A, B) 6 Wr as a pair of 3 x 3 symmetric matrices via 



2 • (A,B) 



2au a i2 ai 3 

ai2 2d22 «23 
0>13 a 23 2a 33 



2&n 612 613 

612 2fe 2 2 ^23 

613 ^23 2633 



where ay, fey € i?. The group Gr = GL2(i?) x SL3(i?) acts naturally on the space Vr. Namely, an element 
53 G SL 3 (i?) acts on Vr by 53 • (A, B) = (gzAg^gsBgQ, while an element g 2 = {% f) G GL 2 (i?) acts by 
52 ■ (^4, -B) = (pA + qB, rA + si?). We have the following theorem parametrizing quartic algebras over R: 

Theorem 3.6 ([4]) There is a canonical bisection between GR-orbits on Wr and isomorphism classes of 
pairs (Q,C), where Q is a quartic ring over R and C is a cubic resolvent ring of Q. 

A cubic resolvent ring of a quartic ring Q is a cubic ring G equipped with a certain quadratic resolvent 
mapping 4 , 3 : Q/R —> C/R; see [3J §2.3, Definition 8] for more details. If (A,B) € Wr, and (Q,C) is the 
corresponding pair of rings, then C can be explicitly described as follows. Let g(X, Y) = 4- det(AX — BY). 
Then g is a binary cubic form over _R, and C is the cubic ring corresponding to the form g under the 
Dclone-Faddeev bijection [26 . (A complete description of the construction of (Q, C) can be found in [4]). 

Since G R — GL 2 (i?) x SL 3 (R) acts on Wr, the groups GL 2 (R) C G R and SL 3 (i?) c G R also act on 
Wr by restriction. Note that the action of SL 3 (i?) fixes the binary cubic form g(X, Y) = 4 • det(AX — BY), 
for any point (A, B) in Wr. 

Now let Vr denote the space of binary quartic forms with coefficients in R. We consider the following 
twisted action of GL 2 (i?) on Vr. Let 7 £ GL 2 (i?) and / € Vr. Then 



7 • f(x,y) = 



1 



det(7) ; 



:f((x,y) ■ 7). 



(28) 



Notice that for R = Z, the action (|28|) is the same as the non-twisted action but for R = R it is different. 
From now on, whenever we refer to an action of GL2(i?) on Vr we shall be talking about the action (|28p. 
Note that the center of GL2(i?) acts trivially on Vr under this action. Therefore the action descends to one 
of PGL2(i?) on Vr, and it is clear that the quantities / and J remain invariant under this action. 
The space of integral binary quartic forms embeds naturally into Wr via the map 



ax + bx 3 y + cx 2 y 2 + dxy 3 + ey A M> 



1/2 



-1 



1/2 



a 6/2 
6/2 c d/2 
d/2 e 



(29) 



We denote the first matrix in the above pair by A\ 



Let f(x,y) = 



- bx y + cx y + dxy + ey € Vr be a binary quartic form and let us write (f>(f) as 



(Ai,Bf). Let (Q, C) be the pair of rings associated to (A\,Bf). As we remarked earlier, C is the cubic ring 
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associated to the binary cubic form g(X, Y) = 4 • det(AiX — BfY), and g is monic because det(j4i) = 1/4. 
In fact, we compute g to be: 



g(X, Y) = X 3 + cX 2 Y + (bd - 4ae)XY 2 + (ad 2 + b 2 e - Aace)Y 3 . (30) 

Moreover, we find that /(/) = 1(g) and «/(/) = J(g)- We define the resolvent binary cubic form of / to be g. 

Note that the group F R ^ C GL 2 (i?) C Gr acts on the subset Wr,\ :— {(A, B) : A = A ± } inside Wr, 
and every Fr^i -equivalence class of Wr,\ contains a unique clement (A\,B) such that the top right entry of 
B is equal to 0. Therefore the space of binary quartic forms Vr maps to the set of Fr i -equivalence classes 
on Wri via the composite map 

Vr -► Wr A ->■ Fr A \Wr A . 

From the representation of PGL 2 (i?) on binary quadratic forms px 2 — 2qxy + ry 2 , we obtain a map 
p : PGL 2 (i?) SL 3 (i?), given explicitly by 



a 



b\ I ( d " cd c " \ (31) 

i y — ; — — | 2bd ad + be 2ac 

b 2 ab a 2 



c d I ad - be 



The image of PGL2(i?) is contained in the orthogonal group SO(Ai,R) because A\ is the Gram matrix of 
the ternary form q 2 — pr. In fact, the image of PGL 2 (i?) is equal to SO(Ai, R); the proof parallels that of 
Lemma 4.4.2 in [41] . 

It is easily checked that the map cf> is equivariant with respect to p; i.e., (f>(j ■ f) — p(^) ■ 4>(f) for 
7 G PGL 2 (-R) and / g Vr. Therefore, we have the following theorem: 

Theorem 3.7 The map <j) defined by <\29\ gives a canonical bijection between PGL 2 (R)-classes on Vr and 
Fr,i x SO(Ai, R)-classes on Wr,\. 

We now restrict ourselves to the case where R = Z. It is known that A\ is the Gram matrix of the 
unique quadratic form over Z up to SL3(Z)-equivalence having determinant equal to 1/4. Therefore, every 
pair (A, B) £ Vz such that det(A) — 1/4 is Gz-equivalent to (Ai,B') for some integral ternary quadratic 
form B' . We thus have the following corollary to Theorem 13 .71 which is Theorem 4.1.1 in [41] . 

Corollary 3.8 (Wood |41j) There is a canonical bijection between GL 2 (Z) -equivalence classes of integral 
binary quartic forms and isomorphism classes of pairs (Q,C), where Q is a quartic ring over Z and C is a 
monogenized cubic resolvent ring of Q. 

In i j3.8t we will also have occasion to use Theorem 13.71 when R = M. 

Combining Theorem 2.1 and Corollary 13.81 we obtain the following theorem. 



Theorem 3.9 Let (X) denote the number of isomorphism classes of pairs (Q,C) where Q is an order 
in a quartic field having 4 — 2i real embeddings and C is a monogenized cubic resolvent ring of Q having 
height less than X . Then 

(a) Mf\x) = ^ E C(2)X 5 ^ + 0(X 3 /^)- 

(b) M«(X) = ^(2)X 5/6 + 0(X 3 / 4+£ ); 

(a) Mi 2) (X) = 4((2)^ 5/6 + 0(X 3 / 4+e ). 
135 
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3.4 The parametrization of 2-torsion elements in the class groups of monogenic 
cubic fields 

A cubic ring C over Z is said to be maximal if it is not strictly contained in any other cubic ring over Z. If 
a cubic ring C is maximal, denote by 02(C) and ClJ(C) the 2-torsion subgroup of the class group and of 
the narrow class group of C, respectively. Denote the abelian groups dual to 02(C) and Cl^ (C) by 02(C)* 
and 02(C)*, respectively. 

If we restrict Corollary 13.81 to maximal orders, then we obtain the following theorem: 

Theorem 3.10 Let g be an irreducible monic integral binary cubic form having invariants I and J such 
that the cubic ring C(g) := Z[x]/g(x) is maximal. 

(a) Suppose 4/ 3 — J 2 > 0. Then there is a canonical bisection between elements o/ClJ(C)* and GL2(Z)- 
equivalence classes of integral binary quartic forms having invariants I and J. 

Under this bijection, elements of 02(C)* C O^(C)* correspond to those binary quartic forms having 
4 real roots. 

(b) Suppose 4/ 3 — J 2 < 0. Then there is a canonical bijection between elements of 02(C)* = ClJ(C)* and 
GL2(Z)- equivalence classes of integral binary quartic forms having invariants I and J . 

Moreover, the set of binary quartic forms having an integer linear factor and invariants equal to I and J all lie 
in the same GL2(Z)- equivalence class; this equivalence class corresponds to the identity element o/ClJ(C)*. 

Proof: If Q is a maximal quartic order, then we say that Q is overramified at the prime p G Z if p factors 
into primes in Q (g> Z p as P , P 2 , or P 2 P 2 2 ; similarly, Q is overramified at the the archimedean prime of Z if 
Q ® R = C 2 . A maximal quartic order Q (or the quartic field A4 in which it lies) is nowhere overramified if 
Q is not overramified at any prime of Z (including the archimedean prime). 

The significance of being nowhere overramified is as follows. Given an 54-quartic field A4, let A24 
denote its Galois closure. Let A3 denote a cubic field contained in K24 (the "cubic resolvent field of K4"), 
and let K§ be the unique quadratic extension of A3 such that the Galois closure of Kq over Q is A24. 
Then it is a theorem of Heilbronn |31j that the quadratic extension K§ / A3 is unramified precisely when the 
quartic field K4 is nowhere overramified. Conversely, if A3 is a noncyclic cubic field, and Kq is an unramified 
quadratic extension of A3, then the Galois closure of Kq is an ^-extension A24 that contains a unique (up 
to conjugacy) nowhere overramified quartic extension K4. 

Thus isomorphism classes of nowhere overramified quartic fields K4 are in canonical bijection with 
isomorphism classes of pairs (Kq, A3) up to isomorphism, where A3 is a cubic field and K§ is an unramified 
quadratic extension. By class field theory, we then see that isomorphism classes of nowhere overramified 
quartic fields K4 are in canonical correspondence with pairs (K^^H), where A3 is a cubic field and H is 
an index 2 subgroup of the class group C1(A 3 ) of A 3 . Of course, index 2 subgroups H of C1(A 3 ) are in 
canonical correspondence with nontrivial characters £ : 0(^3) — > {±1} having kernel H, and so they are 
thus in correspondence with nontrivial elements of the abelian group dual to 02(^3), the 2-torsion subgroup 
of the class group 0(^3) of A" 3 . 

We conclude that isomorphism classes of nowhere overramified quartic fields K4 are in bijective 
correspondence with pairs (K^^g), where A3 is a cubic field and g is a nontrivial element of 02^3)*. An 
analogous argument shows that isomorphism classes of quartic fields K4 that are not overramified at any 
finite prime are in bijective correspondence with pairs (A3, g), where A3 is a cubic field and g is a nontrivial 
element of ClJ(A3)*. Restricting the above correspondences to quartic fields having monogenic resolvent 
cubic fields, in conjunction with Corollarv l3.8l now yields parts (a) and (b) of the theorem. 

To see the final assertion of the theorem, we recall that if C is a maximal cubic order, then there is 
a unique quartic ring Q (up to isomorphism) that is not an integral domain but has C as its cubic resolvent 
ring, namely, Z © C. If C is both maximal and monogenic, then by Corollary 13.81 Q will correspond to 
the unique GL2(Z)-equivalence class of binary quartic forms corresponding to the pair (Z © C, C), and any 
binary quartic form in this equivalence class will have a linear factor over Z. □ 
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Theorem l3.10l implies that if I and J are the invariants of a maximal monogenic cubic ring C, then 
the set of all GL2 (Z)-equivalence classes of integral binary quartic forms having invariants equal to (/, J) 
naturally forms a group, namely, the elementary abelian 2-group Cl^(C)*. In particular, the number of 
GL2 (Z)-equivalence classes of integral binary quartic forms having such invariants (J, J) is always a power 
of two! We will show in Sections 3.5, 3.6, and 3.7 (in Lemma 13.151 Proposition 13.171 and the proof of 
Theorem II. 91 respectively) that "most" — indeed a density of 1/C(2) ~ 60.8% of — pairs (J, J), when ordered 
by height, have this property. 



3.5 Computations of p-adic densities 

A cubic ring C over Z is maximal if and only if for all primes p it is maximal at p, i.e., C ® Z p is the maximal 
cubic ring over Z p contained in C®Q p . Similarly, a quartic ring Q is maximal (i.e., Q is the maximal quartic 
ring in Q <g> Q) if and only if for all primes p it is maximal at p, i.e., Q (g> Z p is the maximal quartic ring 
over Z p contained in Q ® Q p . We say that a pair (Q,C), where Q is a quartic ring over Z and C is a cubic 
resolvent ring of Q, is strongly maximal (resp. strongly maximal at p) if both Q and C are maximal (resp. 
maximal at p). 

In order to determine the average number of 2-torsion elements in the class groups of monogenized 
cubic fields, when these fields are ordered by their heights, we wish to count: 1) the number of Fz i-orbits 
on the space Uz,i of integral monic binary cubic forms that correspond to maximal cubic orders C having 
bounded height under the bijection of Proposition 13. 11 and 2) the number of GL 2 (Z)-orbits on the space Vz 
of integral binary quartic forms that correspond to strongly maximal pairs (Q, C) of bounded height under 
the bijection of Corollary 13.81 To do this, we need to understand, and determine the density of, the set of 
congruence conditions on Uz,i and Vz that define maximality on the corresponding cubic and quartic rings. 



Monic binary cubic forms 

We first consider the sets Uz,i, Uz p ,i, and J7f 1 of monic binary cubic forms over the integers Z, the p-adic 
ring Z p , and the residue field F p , respectively. Any form g S Uz,i (resp. Uz 1, J7f p ,i) determines exactly 
three points in F p , obtained by taking the roots in F p of the reduction of g modulo p. We define the symbol 
{9>P) by setting 

where the /j's indicate the degrees of the fields of definition over ¥ p of the roots of g, and the Cj's indicate 
the respective multiplicities of these roots. There are thus five possible values for the symbol (g,p), namely, 
(111), (12), (3), (1 2 1), and (l 3 ). Furthermore, it is clear that if two monic binary cubic forms g\ , g2 over Z 
(resp. Zp, Fp) are equivalent under a transformation in F z ,i (resp. Fz p ,i, Fp 1), then (gi,p) — (52, £>)■ 

If C(g) is the ring corresponding to an integral monic binary cubic form g, then the symbol (g,p) 
conveys precisely the splitting behavior of C(g) at p. For example, if (g,p) = (111) for g S Uz,i, then this 
means that p splits completely in the cubic ring C(g). 

Let T p {Ux, (111)), T p (Ui, (12)), etc., denote the set of g € U z ,\ such that (g,p) = (111), (g,p) = (12), 
etc. Let fi — fip be the Haar measure on Uj, P; i normalized to satisfy fi(Uz Pl i) — 1. For any set S in Uz.i 
that is definable by congruence conditions, let fx(S) denote the p-adic density of the p-adic closure of S in 
Uw. p ,\- We then determine the densities of the sets T p (Ui, ■ ) in the following lemma. 

Lemma 3.11 We have 

KT p (U l7 (111))) = i(p-l) ( P -2)/p 2 , 
H[T P {U U (12))) = !(p-l)/p, 



H(T P (U U (3))) = \{p 2 -l)/p 
MW, (1 2 1))) = ( P -l)/p 2 



2 



M (Tp(EMl 3 ))) = 1/p 2 



Proof: Since the criteria for membership of g in T p (U±, •) depends only on the residue class of g modulo p, 
it suffices to consider the situation over F p . We examine first p,(T p (Ui, (111))). The number of unordered 
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distinct triples in F p is ip(p — l)(p — 2). Furthermore, given such a triple, there is a unique element in Uw p ,i 
having this triple as its set of roots. As the total number of forms in U^ p ,i is equal to p 3 , it follows that 
li(T p {U\, (HI))) = \{p -1)0- 2)/p 2 as given by the lemma. 

Similarly, the number of unordered triples of distinct points, one member of which is in F p while 
the other two are F p -conjugate in F p 2, is given by \p(p 2 — p). Also, the number of unordered F p -conjugate 
triples of distinct points in F p3 is {p 3 -p) /3. We thus have (i(T p (Ui, (12))) = \ (p- l)/p and n(T p (Ut, (3))) = 
§(p 2 -l)/p 2 - 

Finally, the number of forms in J7f p ,i having a double root at a point of ¥ p and a single root at a 
different point of F p is p(p — 1), and the number of forms in Uw i having a triple root in ¥ p is p. Therefore 
H(T P (U 1} (1 2 1))) = (p - 1)/ P 2 and »(T p (U u (l 3 ))) = 1/p 2 . □ 

We now state a lemma proved in [9l Section 3]. 

Lemma 3.12 The cubic ring C(g) corrresponding to an integral binary cubic form g is nonmaximal at p if 
and only if any one of the two following conditions are satisfied: (1) g is a multiple of p; or, (2) there exists 
an integer r such that r (mod p) is a multiple root of g (modp), and furthermore g(r) is a multiple ofp 2 . 

Let U p (Ui) denote the subset of elements g € Uz p ,i that are maximal at p, i.e., whose associated 
cubic rings C(g) are maximal at p. Similarly, let U P (U\, ■ ) denote the subset of elements g € T p (Ui, ■ ) 
such that C(g) is maximal at p. In the following lemma, we determine the densities of the sets U p (Ui, ■ ) in 

|(P-1) (p-2)/V, 
l(p-l)/p, 
I (P 2 ~ 1)/P 2 , 
(p-l) 2 /p 3 , 
(p-l)/p 3 . 

Proof: If g is an element of T p (Ui, (111)), T p (Ui, (12)), or T p (Ui, (3)), then the cubic ring corresponding 
to g is clearly maximal at p, as its discriminant is coprime to p. Thus U p (Ui, (111)) = T p (U±, (111)), 
Z4,(I7i,(12)) = T p (l7i,(12)), and U P (U U (3)) = T P (U U (3)). 

Now, if g is in T p (Ui, (1 2 1)) or T p (Ui, (l 3 )), then it can be brought into the form g(x,y) = x 3 + 
rx 2 y + sxy 2 + ty 3 with s = t = (mod p), namely, by sending the unique multiple root of g (mod p) to 
(mod p), via a transformation in Fz,t- Of all g in T p (Ui, (1 2 1)) or T p (Ui, (l 3 )) that have been rendered in 
such a form, a proportion of 1/p actually satisfy the congruence t = (mod p 2 ). Thus, by Lemma 13.121 a 
proportion of (p — l)/p of forms in T P (U\, (1 2 1)) and in T p (Ui, (l 3 )) correspond to cubic rings maximal at p. 
The lemma now follows from Lemma 13.111 □ 

Binary quart ic forms 

We next consider the sets Vz, Vz , and Vf of binary quartic forms over the integers Z, the p-adic ring Z p , 
and the residue field F p , respectively. Each form f G Vz (resp. Vz , Vw ) determines exactly four points in 
Pi , obtained by taking the roots of / reduced modulo p. Again, for such a form /, define the symbol (f,p) 
by setting 

(/,p) = (/r/! 2 ---) ) 

where the /j's indicate the degrees of the fields of definition over F p of the roots of /, and the e^'s indicate 
the respective multiplicities of these roots. There are then eleven possibilities for the value of the symbol 
(f,p), namely, (1111), (112), (13), (22), (4), (1 2 11), (1 2 2), (1 2 1 2 ), (2 2 ), (1 3 1), and (l 4 ). Furthermore, it is clear 
that if two binary quartic forms ft, fa over 1 (resp. Z p , F p ) are equivalent under a transformation in GL2(Z) 
(resp. GL 2 (Z P ), GL 2 (F P )), then {f u p) = (f 2 ,p). 



Lemma 3.13 We have 

»{U P {U X , (111))) = 

»(U P (U U (12))) - 

M (W p (C/i,(3))) - 

/i(W p (£Ml 2 l))) = 

^(^.(l 3 ))) - 
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If Q(f) is the quartic ring corresponding to an integral binary quartic form /, then the symbol (/,p) 
conveys precisely the splitting behavior of Q(f) at p. For example, if (f,p) — (1111) for / £ V%, then this 
means that p splits completely in the quartic ring Q(f). 

By T P (V, (1111)), T P (V, (112)), etc., let us denote the set of / £ V z such that (f,p) = (1111), 
(f,p) = (112), etc. Now, for any set S in Vz that is definable by congruence conditions, let us denote by 
n(S) = [i p {S) the p-adic density of the p-adic closure of S in Vz , where we normalize the additive measure 
(i on Vz p so that n(Vz p ) = 1. Let V P (V) denote the set of elements f £ Vz that are strongly maximal at 
p, i.e., whose associated cubic ring C is maximal at p (the associated quartic ring Q is then automatically 
maximal at p). We see from [5J Section 3.1] that the resolvent cubic C of a quartic ring Q can fail to be 
maximal at p in two ways: (1) Q is nonmaximal at p, or (2) Q is maximal but overramified at p. Therefore 
f £ VpOO ^ ano - om y if the corresponding quartic ring is either nonmaximal at p or it has splitting type 
(1 2 1 2 ),(2 2 ), or (l 4 ) at p. 

Let V P (V, ■ ) denote the subset of strongly maximal elements / £ T p (V, ■ ). In the next lemma, we 
determine the densities of the sets V P (V, •): 

Lemma 3.14 We have 



M (v p (v,(mi))) 


= ±(p + i)( P - 


I) 2 (p-2)/p- 




= k(p+ 


l)(p- 


l) 2 /p 3 , 


M (V P (V,(13))) 


= lb 2 - 


- l) 2 /p 4 , 


M(V P (V,(22))) 


= i(p- 


l) 2 b 2 - 


-p-2)/p 4 , 


M(V P (V, (4))) 


= J(p+ 


1) (p- 


i)7p 3 , 


M (V P (V,(1 2 11))) 


= |(p- 


l) 3 (p+ 


1)/P 5 , 


M (V P (V,(1 2 2))) 


= i(p- 


l) 3 (p+ 


1)/P 5 , 


MW(1 3 1))) 


= (p- 


1) 2 (P4 


-1)/P 5 , 


M (V P (T/,(1 2 1 2 ))) 


= o, 






M(W(2 2 ))) 


= o, 






M(v P (y,(i 4 ))) 


= 0. 







Proof: If / is an element of T P (V, (1111)), T p (F, (112)), T p (V, (13)), T p (V, (22)), or T p (V, (4)), then the 
discriminant of the resolvent binary cubic form of / (which is also equal to the discriminant of /) is coprimc 
to p. Thus / is strongly maximal at p, and so V P (V, (1111)) = T P (V, (1111)), V P (V, (112)) = T P (V, (112)), 
V P (V, (13)) = T P (V, (13)), V P (V, (22)) = T P (V, (22)), and V P (V, (4)) = T„(V, (4)). 

As before, membership of / in T P (V, ■ ) depends only on the reduction of / modulo p. Therefore it 
suffices to consider the situation over Vw p - To compute fj,(T p (y, (1111))), note that there are ^(p + l)p(p — 
l)(p — 2) quadruples of unordered distinct points in P 1 (F p ). Four distinct points in P 1 (F p ) determine the 
reduction of / £ T p (V, (1111)) modulo p, up to scaling by elements in F p , by specifying its roots in P 1 (F p ). 
As there are p 5 forms in Vw , we obtain fi(T p (V, (1111))) = ±(p + l)(p - l) 2 (p - 2)/p 4 . The arguments for 
computing the p-adic densities of T p (V, (112)), T p (V, (13)), T P (V, (22)), and T p (V, (4)) are similar. 

If / is in T P (V, (1 2 1 2 )), T p (V^, (2 2 )), or T p (V, (l 4 )), then / is overramified and is not strongly maximal. 
So fi(V p (V, (1 2 1 2 ))), t*(V p (V, (2 2 ))), and /i(V p (V, (l 4 ))) are each equal to 0. 

Suppose that f(x, y) = ax 4 + bx 3 y + cx 2 y 2 + dxy 3 + ey 4 belongs to the closure in Vz of T p (V, (1 2 11)), 
T p (V, (1 2 2)), or T p (V, (1 3 1)). Then the reduction of / modulo p has a unique multiple root in P^Fp). By 
a suitable GL2(Z p ) transformation, we may assume that this multiple root is equal to [0, 1], i.e., d = e = 
(mod p). Since / is not overramified, the reduction modulo p of the binary quadratic form ax 2 + bxy + cy 2 
does not have a double root, i.e., b 2 — 4ac ^ (mod p). The reduction modulo p 2 of the resolvent binary cubic 
form of / is equal to x 3 +cx 2 y+(bd— Aae)xy 2 +e(b 2 — Aac)y 3 , and therefore its reduction modulo p has a double 
root at [0 : 1] G Pi . Therefore / is strongly maximal if and only if e(b 2 — Aac) ^ (mod p 2 ), i.e., e ^ 
(mod p 2 ). This happens with probability 1 - 1/p. Therefore p(V p (V r , (1 3 1))) = (1 - l/p)n{T p (V, (1 3 1))), 
/i(V p (V, (1 2 H))) = (1 - 1/pHT p (V,(1 2 11))), and fi(V p (V, (1 2 2))) = (1 - l/p) M (T p (V, (1 2 2))). A simple 
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computation, similar to the computation of /j,(T p (V, (1111))), then shows that these densities are as stated 
in the lemma. □ 

Summary of p-adic densities for maximality in Uz,i and Vz 

Using the fact that U P {U\) is the disjoint union of the U P (U\, ■ )'s and that V P (V) is the disjoint union of 
the V P (V, ■ )'s, we arrive at the following lemma. 

Lemma 3.15 We have 

^(UpiUx)) = ( P 2 -l)/p 2 , 
M (V P (F)) = (p 2 -lf/ P 4 . 

3.6 Congruence conditions and uniformity estimates 

We now wish to count the number of Fz t i -equivalence classes of monic integral binary cubic forms of bounded 
height whose coefficients satisfy a specified finite set of congruence conditions. 
Similar to Theorem 12. Ill we have: 

(i) 

Theorem 3.16 Suppose S is an F% t i-invariant subset ofU% [ defined by finitely many congruence conditions. 
Then we have 

N(S; X) = N(U®;X) ]J » P (S) + 0{X^). (32) 
v 

However, the set of forms in Uz,i corresponding to maximal cubic rings is defined by infinitely many 
congruence conditions, namely, by congruence conditions modulo p 2 for every prime p. Similarly, the set 
of forms in Vz corresponding to strongly maximal pairs of rings is defined by infinitely many congruence 
conditions, again by congruence conditions modulo p 2 for every prime p. To prove that versions of (1241) and 
(I3"2l still hold for such sets, we require a uniform estimate on the error terms in and (f3"2"j) as the number 
of congruence conditions on S increases. 

Let W p (Ui) denote the set of elements in Uz i that are not maximal at p. Let W P (V) denote the 
set of elements in Vz that are not strongly maximal at p. Then we require the following estimates. 

Proposition 3.17 N(W p (Ui); X) = 0(X 5 / 6 /p 3 / 2 ), where the implied constant is independent of p. 

Proof: In [2j Lemma 2] it is shown that the number of isomorphism classes of cubic rings that are not 
maximal at p and have discriminant less than X is bounded by 0(X/p 2 ), where the implied constant is 
independent of p. Now the height of a monogenized cubic ring is at least half of the absolute value of its 
discriminant; therefore, using the fact that a cubic ring can have at most 12 monogenizations (see [25] and 
[57]), we obtain that i\r(W p (?7i);Z) = 0(X/p 2 ). 

Next recall that the set 5? = {X 3 + rX 2 Y + sXY 2 + tY 3 : r G {-1,0, 1}; s,t G Z}, defined in 
(|27p. is a fundamental domain for the action of Fz,i on Uz,i- Hence N(W p (Ui); X) is equal to the number 
of forms in 5? having height less than X that correspond to cubic rings that are not maximal at p. Now 
Lemma 13.121 implies that for fixed integer values of r and s, there exist at most two values of t (mod p 2 ) 
such that the cubic ring corresponding to the form f(X, Y) = X 3 + rX 2 Y + sXY 2 + tY 3 is not maximal at 
p. Furthermore, by the proof of Lemma l3~3l if / G 5? and H(f) < X, then s = 0(X 1 / 3 ) and t = 0(X 1 / 2 ). 

Therefore, we obtain the estimate 

N(W p (Ux);X) = 0(X 1/3 ) ■ 0(max{2, X 1/2 /p 2 }) = 0{X 1/3 + X 5/6 /p 2 ), 
which coupled with the earlier estimate N(W P (U\)\ X) — 0(X/p 2 ) yields the result. □ 
We similarly have 

Proposition 3.18 N(W P (V); X) = 0{X 5 / 6 /p 5 ^ 3 ), where the implied constant is independent of p. 

The proof of Proposition 13.181 is significantly more difficult than that of Proposition 13.171 Its proof will be 
one of the primary goals of Section |U 
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3.7 Proof of the main theorem (Theorem 11.91) 

In this subsection, we compute the mean number of 2-torsion elements in the class groups of monogenizcd 
cubic fields when these monogenized cubic fields are ordered by height. We thus prove Theorem ll.9l 

Let 14(11^) and U(U^) denote the subset of elements in Uz,i coppesponding to maximal cubic 
rings that have positive and negative discriminant, respectively. 

Proposition 3.19 We have 

a) hm "<Wft*> - 8 • 



x^oo X 5 / 6 135C(2) ' 

b) lim *m (1) );*) = _3g_ 
°> xZo 135C(2) 

Proof: We only prove part (a) of the proposition as the proof of part (b) is identical. Let U p (u[ ) be the 

set of elements in U%\ that are maximal at p. Then by Proposition 13.41 Lemma \3. 151 and Theorem 13. 161 it 
follows that for any fixed positive integer Y, we have 

lim ^<^p(f ) )^) = J_ n(1 - p - a) . 
x^oo X 5 / 6 135 11 v y ' 

p<Y 



Letting Y tend to infinity, we obtain that 

N{U(U[ 0) );X 



*S2 W 1 = lis nc - ^ - u^ry o> 



To obtain a lower bound for N(U(U^ ); X), we note that 

n^(t/i o) )c (u(u[ o) )u u w P {ui) 

p<Y p>Y 

Hence, by the uniformity estimate of Proposition 13 . 1 7l we obtain 

iV(^));X) >A 
x^oo X 5 / 6 - 135 11 > 

p<Y p>Y 

Letting Y tend to infinity completes the proof. □ 

Identically, by Proposition 12.11 Lemma 13.151 Theorem 12.111 an d Proposition 13. 18) if we denote the 
set of elements in corresponding to strongly maximal pairs of rings {Q, C) by V(V^), then we have 

lim W ( ;>);*) = 402) n _ 2 2 = 4 
x^oo X 5 / 6 135 111 y ' 135((2) V ; 



for i = 0, 2+, and 2—, and 



^(V^ 1 ));^) 32C(2) u _ 2 2 32 

— xm — = ^^W-p ) =i35C(2y (35) 
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By Theorem 13. 101 we see that 



E (#Ch(K)-l) = N(V(V^);X), 



H(K)<X 
Disc(if)>0 



E (#Ch(K)-l) = N(V(VW);X), 



H(K)<X 
Disc(_fC)<0 



E (#C\+(K) - 1) = NQ>(yW);X) + N(V(V^);X) + N{V(V^);X), 

H(K)<X 
Disc(if)>0 

where K = (K, x) ranges over monogenized cubic fields. We therefore have 

E 

x^oo 1 8/(135C(2)) 

H(K)<X 
Disc(A")>0 



lim ^ 1 + 32/(135 C (2)) 

x^oo Y] 1 32/(135C(2)) 

H(K)<X 
Disc(_R")<0 



E # C1 ^) 



lim = 1+ 12/(135C(2)) = 

x^oo V" ! 8/(135C(2)) 

ff(if)<X 
Disc(_fC)>0 

This proves the weaker version of Theorem 11.91 where we average over all monogenized cubic fields without 
any specified local splitting conditions. 

To complete the proof of Theorem 11.91 also in the case where finitely many splitting conditions are 
imposed, we note that if a single splitting type a is imposed on the cubic fields we are considering at a given 
prime p, then the Euler factor at p in (|33|) changes from jj, p (U p (Ui)) = (1 — p~ 2 ) to [i p (U p (Ui,a)); meanwhile, 
the Euler factor at p in (jMf changes from /j, p (V p (V)) = (1 — p~ 2 ) 2 to 

E »p(V p (V,6)), (36) 

where R is the natural map from quartic splitting types to cubic splitting types, namely: given a strongly 
maximal pair of rings {Q, C), the splitting type 9 of Q at p determines the splitting type R{6) of C at p. 
Thus to complete the proof of Theorem 11.91 it suffices to prove that the equality 



fi p (u p (Ui) MW)) 

of p-adic density ratios holds for all values of p and a. 

To prove (|3"T)) . we first determine R(0) in terms of 9 using the following well-known fact (see, e.g., 
[3"5] for a proof). 
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a 


Hp{Up(Ux) 


V / 


v V p (V,0) 


(111) 
(12) 


p-2 


{(1111), (22)} 
{(H2),(4)} 


P- 


2 p-2 p- 


2 


Rfm _L 1 1 

0(p + 1) 

P 


24(p- 
P 


fl) 8(p+l) 6(p4 


-1) 


2(v + 1) 


4(p^ 


-1) 4(p + l) 2(p + 


1) 


(3) 


1 

3 


{(13)} 




1 

3 




(1 2 1) 


p-1 


{(1 2 11),(1 2 2)} 


p - 


1 p-1 p - 


- 1 


P{P + 1) 


2p(p4 


-1) 2p(p+l) p(p- 


f 1) 


(I 3 ) 


1 


{(1 3 1)} 




1 




P(P + 1) 




pip + 1) 





Table 2: Demonstration of the equality of p-adic density ratios for U± and V 



Lemma 3.20 For any field F , there is a natural bijection between isomorphism classes of etale F-algebras 
of degree n and isomorphism classes, under conjugation within S n , of continuous homomorphisms ip : 
Gal(-F) —¥ S n , where Gal(-F) denotes the absolute Galois group of F. 

Now consider the canonical surjection A : S4 — > S3 whose kernel consists of the identity and the 
double transpositions in S4. If L is the degree four etale algebra over Q p corresponding to the map ip : 
Gal(Qp) — > 54, then one shows that the cubic resolvent K of L corresponds to the map Xoip ■ Gal(<Q> p ) — > S3 
(see 0]). 

Thus, for example, if the splitting type of Q at p is (22), then the image of the map ip : Gal(Q p ) — > ^4 
corresponding to Q ® Q p is a cyclic group C2 of order 2, where the nontrivial element of C2 is a double 
transposition. Hence the image of the composite map A o ip : Gq —> S3 is trivial and the splitting type of p 
in C is (111). Therefore i?((22)) = (111). The computations of R{9) for other splitting types 9 are similar. 

Now suppose p is a fixed prime and a = (111). Then the set R^ 1 (a) is equal to {(1111), (22)}. By 
Lemmas 13. 131 and I3.15[ we see that 

H P (Up(Ui,a)) _ p-2 
P P (W p ((7i) ~6(p+l)' 

while from Lemmas 13.141 and 13.151 we see that 

MUflefl(«r)Vp(y,fl)) = P-2 , P-2 = p-2 
H P (Vp(V)) 24(p+l) 8(p+l) 6(p+l)' 

The computations for other values of a are identical and are summarized in Table [5J This completes the 
proof of Theorem 11.91 

3.8 Proofs of auxiliary lemmas 

In this subsection, we complete the proofs of various auxiliary lemmas whose proofs were deferred. We begin 
by determining the order of the stabilizer in GL2(M) of binary quartic forms in Vr, thus proving Lemma l2.2l 

Proof of Lemma 12. 2t We start by computing the size of the stabilizer in PGL 2 (R) of binary quartic forms 
in Vr having nonzero discriminant, where PGL 2 (R) acts on Vr via the "twisted action" defined in (|31l) . 

By restricting the results of W3.3\ to the case R = M, we obtain an injective map p : PGL2(R) — >■ 
SL3(R), and a map <f> : Vg. — > Wi which is equivariant with respect to p. Let / be an element in Vr having 
nonzero discriminant and let 4>{f) = {A\,Bf). It is clear that 7 £ PGL2(R) stabilizes / £ Vr if and only 
if p(i) stabilizes (A u B f ), and ^(7) stabilizes (A u B f ) if and only if (id,p( 7 )) £ Gr = GL 2 (R) x SL 3 (R) 
stabilizes {A\,Bf). 
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An element (A, B) £ Wr that is nondegenerate (i.e., the discriminant of the cubic resolvent form of 
(A, B) is nonzero) determines four points Pi, P%, P3, and P4 in P 2 (C); these are the four intersection points 
in P 2 (C) of the two conies defined by A and B. It follows from [3] that the stabilizer of (A,B) in Gr can 
be canonically realized as a subgroup of the permutation group S4 on the set of points {Pi, P2, P3, P4}. 
Furthermore, an element 7 £ Gr stabilizing (Ax, Bf) has an inverse image in PGL2(R) under p if and only 
if 7 is contained in the kernel of the canonical surjection A : S4 — > S3 so that 7 is either the identity or a 
double transposition. 

If the splitting type of / at 00 is equal to (112) (resp. (1111), (22)), then the stabilizer of (A\,Bf) in 
PGL 2 (R) is a subgroup G2 x G2 (resp. S4, D4) of S4 which contains 1 (resp. 3, 3) double transpositions. Now 
if 7 £ GL 2 (M) stabilizes / £ Vr under the action defined in Equation then since I(j ■ f) — (detj) 4 I(f) 
and J (7 • /) = (det7) 6 J(/), the determinant det7 £ 1 of 7 is equal to ±1. Hence the image of 7 in 
PGL2(K) also stabilizes /. Since there are two elements in the center of GL2(M) that stabilize /, the size of 
the stabilizer in GL2(R) of an element v £ is 4 for i — 1 and 8 for i = or 2, as desired. □ 

Our next goal is to prove Lemma 12.41 which bounds the number of GL2(Z)-equivalence classes of 
integral binary quartic forms having bounded height and nontrivial stabilizer in GL2(Z). We first require: 

Lemma 3.21 The number of -equivalence classes of binary quartic forms f such that H(f) < X, 

and the binary cubic resolvent form of f is reducible, is bounded by 0(X 3 ^ 4+e ) . 

Proof: Let g be an element of Uz,i that has discriminant n and is reducible over Z, and let G be the 
corresponding cubic ring. Since G is a cubic ring lying in a cubic Q-algebra of the form Q © F, where F is a 
quadratic Q-algebra, by Proof of Lemma 12] the number of quartic rings, up to isomorphism, having G 
as a cubic resolvent ring is bounded by 0(n 1/4+£ ). 

Now the height of a cubic ring having discriminant n is at least n/2. Moreover, we see from 
Lemma 13.31 that the number of monogenized cubic rings having height less than X that correspond to 
reducible integral monic binary cubic forms is bounded by 0(X 1 ^ 2+e ). We conclude from Corollary 13.81 that 
the number of GL2(Z)-equivalence classes of binary quartic forms / such that H(f) < X, and the cubic 
resolvent form of / is reducible, is bounded by 0(X 1 / 2+e • X 1 / 4+e ) = 0(X 3 / 4+e ). □ 

Proof of Lemma l2.4t Suppose an integral binary quartic form / has a stabilizer of size greater than or equal 
to 2 in PGL2(Z), and let (Q, G) be the corresponding pair of rings. Then Q has a nontrivial automorphism 
over Z and therefore cannot be an order in an 54- or an ^-quartic field. If Q is an order in a -D4-, G4-, or 
T/4-quartic field, or is a quartic ring lying in a direct sum of two quadratic Q-algebras, then the cubic resolvent 
ring G of Q cannot be an integral domain; the desired bound in this case then follows from Lemma 13.211 

Thus we may assume Q is a quartic ring in an etale quartic Q-algebra of the form F = Q © K, 
where if is a cubic field, and Disc(Q) = k 2 Disc(K) for some k £ Z>o- As Q has nontrivial automorphisms, 
K must be a G3-cubic field. Thus Q has a cyclic group of order 3 as its automorphism group, as does G. 
The number of possibilites for G with discriminant at most X is then 0{X 4 / 2 ) by [101 Theorem 1]. Once G 
has been fixed, K is determined; furthermore, it follows from a result of Nakagawa |37j that the number of 
orders of index k in the ring of integers in the etale Q-algebra F = Q© K is at most 0(fc 1 / 2+e ) = 0(X 1 / 4+e ), 
where the implied constant is independent of F. We conclude that the total number of possibilities for the 
pair (G, G) having discriminant at most X is 0{X X / 2 ■ X x / i+€ ) — 0(X 3 / 4+e ) in this case. The same estimate 
thus also holds for height as in the proof of Lemma I3.21[ yielding the desired conclusion. □ 

Finally, we determine when a pair of invariants (/, J) £ Z x Z is eligible thus proving Lemma 12.91 

Proof of Lemma 12.91 If an integral binary quartic form has invariants equal to I and J, then its cubic 
resolvent form also has invariants equal to I and J. Therefore, for an integer pair (/, J) to be eligible, it 
must satisfy one of the congruences listed in Lemma I3~2"l Conversely, suppose an integer pair (I, J) satisfies 
one of the congruences of Lemma 13.21 By Lemma 13.21 there exists an integral monic binary cubic form 
g(X, Y) = X 3 + rX 2 Y + sXY 2 + tY A having invariants equal to I and J. One then checks that the cubic 
resolvent form of the binary quartic form f(x,y) — x 3 y + rx 2 y 2 + sxy 3 + ty A is equal to g, and so / has 
invariants equal to I and J. Therefore the pair (I, J) is eligible. □ 
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4 The mean number of 2-torsion elements in the class groups of 
submonogenized cubic fields 

In the previous section we determined the mean number of 2-torsion elements in the class groups of monog- 
enized cubic fields when these monogenized cubic fields are ordered by height. We now consider "submono- 
genized cubic fields" . We define a submonogenized cubic field of index n (or, an n-monogenized cubic field) 
to be a cubic field F together with an element x in the ring of integers k of K such that the index of Z[x] 
in @k is equal to n. If g denotes the characteristic polynomial of xx : k — ► 0k, then we define the height 
of the n-monogenized field (K, x) by 

H(K 1 x):=n 2 H(I(g) 1 J(g)). (38) 

We say that two n-monogenized cubic rings (C,x) and (C ,x') are isomorphic if C is isomorphic to C and 
there exists u € Z such that under this isomorphism x is mapped to x' + u. 

In this section, we determine the mean number of 2-torsion elements in the class groups of submono- 
genized cubic fields having height bounded by X and index bounded by X s , thus proving Theorem 11.101 
Surprisingly, these means are different from those obtained in Theorem 11.91 for monogenized cubic fields; as 
expected, they instead coincide with the averages for the number of 2-torsion elements in the class groups 
of cubic fields when they are ordered by discriminant (see [5]). Theorems 11.91 and 11.101 demonstrate that, on 
average, the monogenicity of a ring has a clear altering effect on the class group. 

This section is organized as follows. In §4.1, we determine the number of isomorphism classes of 
submonogenized cubic rings C having bounded height and index. In §4.2, we similarly determine the number 
of isomorphism classes of pairs (Q, C), where Q is a quartic ring and C is a submonogenized cubic resolvent 
ring of Q having bounded height and index. The layout of each of these subsections is largely parallel to 
that of Section 2. In §4.3, §4.4, and §4.5 we then prove, as in Sections §3.5 and §3.6, the various results 
needed to perform a sieve (analogous to the one in §3.7) to maximal orders. The proof of Theorem 11.101 is 
then completed in §4.7. 

4.1 The number of submonogenized cubic rings having bounded height and 
bounded index 

In this subsection, we count the number of (isomorphism classes of) submonogenized cubic rings having 
height bounded by X and index bounded by X s , where 6 < 1/4 is a positive constant. More precisely, we 
prove the following theorem: 

Theorem 4.1 Let (X,6) ( resp. N% (X,S)) denote the number of submonogenized rings (G,x) of index 
n with positive (resp. negative) discriminant such that H(C,x) < X and n < X s . Then we have 

(a) N^ 0) (X,8) = ±X 5 / 6 + 25 / 3 + 0(X 5 ^); 

(b) N^(X,S) = ^X 5 / 6 + 2S / 3 + 0(X 5 / 6 ). 

4.1.1 The parametrization of submonogenized cubic rings of index n 

For an integer n > 0, let Uz, n denote the space of all integral binary cubic forms whose x 3 coefficient is equal 
to n. To an element g(x, y) — nx 3 + bx 2 y + cxy 2 + dy 3 € Uz, n , we associate the n-monogenized cubic ring 
(Z[X,Y]/(X 3 + bX 2 + ncX + n 2 d, X-nY),X). As the n-monogenized cubic rings (C,X) and (C,X + u) are 
isomorphic, it is also natural to consider the corresponding integral binary cubic forms g(x, y) and g(x+u, y) 
in Uz,n to be isomorphic for all u G Z. We thus consider the action of the group Fz,i on Uz, n , defined as 
before by 7 • g(x, y) = g((x,y) -7). We then have the following proposition generalizing Proposition 13. II 

Proposition 4.2 The Fz^i-orbits on Uz,n are in bijective correspondence with isomorphism classes of n- 
monogenized cubic rings over 7L. 
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Let Um. denote the set of all binary cubic forms with coefficients in R. Let £/r,+ C C/r consist of 
those elements g(x, y) = ax 3, + bx 2 y + cxy 2 + dy 3 in C/r such that a > 0. As before, the group 



acts naturally on C/r (and J7k,+) under this action. 

Given a binary cubic form g(x,y) — ax 3 + foe 2 ?/ + cxy 2 + dy 3 in £/r.+ , it is -pR-equivalent to the 

monic binary cubic form g'(x, y) = x 3 H — tr^D + a l ^ 3 cxy 2 + ady 3 . Furthermore, if 7 <E -Fr 1, then the x 3 
coefficient of 7 • g remains equal to a. It is thus natural to define the quantities 

1(g) := 

J(g) := -2b 3 /a + 9bc-27ad, ( 39 ) 
0(5) := a, 

which are then invariant under the action of Fj& t \ C Fr on L/r (compare with (|2"o) in i j3.ll) . As usual, 
the discriminant of g can be written in terms of 1(g) and J(g) via the same formula, namely, A(<?) = 
(47(g) 3 - J(g) 2 )/27. We again define the fre^fa iJ(g) of g G U R)+ by 

fr(ff) := ff(J, J) = max{|/( 3 )| 3 , J( ff ) 2 /4}. (40) 

If (C, x) is an n-monogenized cubic ring and g is a corresponding integral binary cubic form, then 
we define the invariants I, J, and a of (C,x) by I(C,x) := 1(g), J(C,x) := J(g), and a(C,x) := a(g) = n. 
In particular, comparing with (|3"5j) . we see that H(C, a;) = ff(<?). 

Let J7 R °+ (resp. C/^ 1 ^) denote the subset of elements in Um.,+ having positive (resp. negative) discrim- 
inant, and let U%, (resp. UrpA denote the subset of integral elements. In the remainder of this subsection, 
we prove the following result, which combined with Proposition 14. 21 immediately yields Theorem 14. II 

Proposition 4.3 For an F^^-invariant set S, let N(S;X,6) denote the number of equivalence classes 
of irreducible elements g G S satisfying H(g) < X and < a(g) < X s . Then, for any positive constant 
5 < 1/4, we have 

(a) N(U ( ^;X,5) = ± . X 5 / e + 25 / 3 + 0(X 5 / 6 ); 

(b) N(U^;X,5) = ^.x^ 6 + 25 / 3 + 0(X 5 / 6 ). 



4.1.2 Reduction theory 

We first develop the reduction theory necessary to construct a fundamental domain for the action of Fz,i 
on £%,+• Our methods are quite similar to those in ^2.11 we start by describing a fundamental set for the 
action of Fr on J7r,+. 

Let Iq, Jq £ 1 x R be such that 4/q — Jq ^ 0. Then there exists exactly one -FR-equivalence class in 
Ur,+ with invariants Iq and Jo. The analogous result for SL2(C)-equivalence classes in Vc was proven in the 
proof of Proposition I2.8[ an d a very similar argument applies to the case of -FR-equivalence classs in [%,+ . 

As with the action of SL2(C) on Vc, a fundamental set Ly (resp. L^) for the action of Fr on 
Ur\_ (resp. U^ + ) may be constructed by choosing one form g in (resp. E^r 1 ^) for each (I, J) such that 
H(I, J) = 1 and 4/ 3 — J 2 > (resp. 4/ 3 — J 2 < 0). Explicit constructions of such sets Lff are provided 
in Table |3l As in §2.1, the key fact about these Ljj that we require is that the absolute values of all the 
coefficients of all the forms in our Ljj are uniformly bounded. 

We also need the following lemma whose proof is elementary: 
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4 0) = {x*- l r y 2 -^:-2<t<2} 

L ( u ] = p - TfV 2 - ^V 3 : -1 < 5 < l} U {x 3 + ^xy 2 - ^y 3 : -2 < t < 2} 
Tabic 3: Explicit constructions of fundamental sets Lff for F K \/7^ 

Lemma 4.4 For any g G , tfee stabilizer of g in Fr is trivial. 

Let T\ denote the fundamental domain for F Z ^\F R given by the elements n(u)a(t)X with n G N', 
a G A + , and A G A, where 



N ' = {{1 1 ):-l/2<«<!/2}, ^ + = {(* _1 t ) : * >0 }' ^{( A a) :A >°} 



(41) 



Then Ji-L^ is a fundamental domain for the action of Fz,i on J7^ + . 



4.1.3 Estimates on reducibility 

Let TZx(Ljj-,S) consist of the elements g in the fundamental domain TiLy* satisfying H(g) < X and 
a(g) < X s . That is, define 

H X (L$;S) := jf>L« n { W : |fT(«;)| < X} 

where J 7 }^ C T\ consists of those elements naX in T\ satisfying X/X^ < t < A. We prove the following 
lemma. 

Lemma 4.5 Let S < 1/4 be a positive constant. Then the number of integral binary cubic forms in 
that are reducible over Z is 0(X 3 / 4+e ). 

Proof: If g = na(t)X G and t; G L% ] such that .gv G 1Z X (L$;5), then this implies A = Opf 1 / 12 ). Since 
(5 < 1/4, we have the estimates abc = 0(X 9 /t 3 ) = 0(X 3 ^), abd = 0(X 9 /t) = 0{X 3 / 4 ), ab = 0(X 6 /t 4 ) = 
0(X^ 2 ), ac = 0(X 6 /t 2 ) = 0(X 1 / 2 ), and ad = 0(A 6 ) = 0(XV 2 ). 

If ax 3 + bx 2 y + cxy 2 + dy 3 is a reducible binary cubic form, then it has a linear factor. The estimates 
on abc, ab, and ac show that the number of forms in TZx(L$;5) with d — is at most 0{X 3 / i ). If d ^ 0, 
then the estimates on abd and ad show that there are at most 0(X 3 / 4 ) possibilities for the triple (a,b,d) 
in lZx(L^; S). Once the coefficients a,b,d have been fixed, there are then only 0(X e ) possibilities for an 
integral linear factor px + qy of the binary cubic form, because p must divide a and q must divide d. Since 
a,b,d,p, and q then determine c, we conclude that the number of reducible integral binary cubic forms in 
n x {I$;5) is at most 0(X 3 ^+ £ ), as desired. □ 

4.1.4 Averaging and cutting off the cusp 

Let Fq be a compact set in Fr that is the closure of some open nonempty set, and assume further that every 

(i) 

element in F has determinant greater than 1. Let S C U% ' + be an Fz,i -invariant set and let S 1TI denote the 
subset of irreducible forms inside S. Then we have 

J #{.* G n FhL® : H(x) < X, < a(x) < X*}dh 

N{s '' x ' 5) = ■ 
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Just as in §2.3, we see that if Cf = J^p dh, then 
N(S] X,6) = -^- [ #{x G S' 1 " n gF L$ : H(x) < X, < a(x) < X s } dg (42) 

= -'- / #\x G rnn(' _1 JAFqL^ : < X, < o(as) < X 5 } t~ 2 dn d x t d x A. (43) 

Define S(n,t, A, X,5) := n(* _1 JAFqZ^ n {a; G L^ } + : < X, < a(ac) < X 5 }. Then 

N(S ilT ;X,6) = -^- [ #{x £ S hr r\B(n,t,\,X,S)}t- 2 dnd x td x \. (44) 

By our construction of the L v and i<o, all the absolute values of the coefficients of the forms in FqL\j' 
are uniformly bounded. Let C% and Ci be constants that bound the absolute values of the x 3 coefficients of 
all the forms in FqLjj from above and below, respectively. Let C3 be a constant such that I/C3 2 bounds 



le the forms in FgLfj from above 
number of irreducible lattice points in £?(n, t, A, X, S). 



(i) 

the heights of all the the forms in FqL)j from above. Then we have the following lemma estimating the 



Proposition 4.6 The number of irreducible integral binary cubic forms g in B(n,t,X,X,5) that satisfy 
< a(g) < X s is 

if CiA < 1 or 2^ > X s / 3 or C 3 A < 1; 

Vo\(B{n, t, A, X, 6)) + 0(max{A 9 i 3 , A 9 i~ 3 , 1}) otherwise. 

Proof: If CiX/t < 1 or C\^X/t > X 5 ^ 3 , then no integral binary cubic form in B(n,t,X,X,S) satisfies 
< a < X s . If A < C3 2 , then the heights of all the forms in B(n, t, A, X, S) are less than 1 and B(n, t, A, X, S) 
contains no irreducible integral forms. Otherwise, one sees that the projections of B(n, t, A, X, 5) onto a = 
and d = have volume 0(A 9 i 3 ) and 0(X 9 t~ 3 ), respectively, and all other projections are also bounded by 
0(A 9 i 3 + A 9 t~ 3 ). The result now follows from Proposition [231 □ 



Thus, up to an error of X 3 / 4+e due to the estimates on reducible forms in Lemma l4.5[ we have 

=c 3 Jc 2 \p 



1 rX 1 ^ 12 fC\X r 

N(U^;X,8) = — / / {Yol{B(n,t,X,X,S)) +0(mayi{t 3 X 9 ,t- 3 X 9 ,l}))t~ 2 dnd x td x X. 

L-F Jx=C 3 Jc 2 \/X s / 3 JN' 



The integral of the second summand is 0(X 5 ^ 6 ), while the integral of the first summand is \o\(Tlx(Ljj ] 5)) 
up to an error of 0(1). Thus, to complete the proof of Proposition 14.31 it remains only to compute the 
volume of TZx (Ljjj ; S) . 

4.1.5 Computing the volume 

The product map N x A + x A — > i*R is a bijection and this decomposition gives a Haar measure dg on Fr, 

namely, dg = t~ 2 dnd x td x X — t~ 2 dud x td x X. Write Lj^ as a union — Lj^' 1 \JL^' J , where L^' 1 contains 

the points where J = ±1, and L^' J the points where / = ±1. These subsets intersect in only finitely many 

points. We use the natural measures dl and dJ on L^' 1 and L < jj' J , respectively, and the usual Euclidean 
measure dv on Us.. Then we have the following proposition. 

Proposition 4.7 Let f be any measurable function on Us- Then we have 

(iotgff(g-f,)dgdI= I ,,./(»)*; 



g&F K j v£FwL ^-' 
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(b) I / _ / (det 5 ) 10 /(& -./■./ )</;/'/■/ = / ./('-Wr. 
A simple Jacobian computation proves Proposition 14.7 

-(<: 

) Tin nr-of^T-^ k,r DvA^nfU;^ IT 

rX 1 ' 12 ,A 



We now compute the volume of T^x(L^;S). If 7 € F K such that | det(7)| = A 2 , then H(j ■ v) = A 1 
for each v <E Ljj . Therefore, by Proposition 14.71 we see that 



Vol(n x (L i ° ) ;5)) = - [ f f x 12 t- 2 d x td*\dJ=-X 5 / e + 25 / 3 --X 5 / e . (46) 

9Jj=-2J\=q Jt=\/xe/z 45 45 

Similarly, the volume of lZx{L^; 5) is equal to 

A f 2 rX 1/12 pX a pi pX 1/12 f\ 

- / / \ 12 t- 2 d x td x XdJ + 2- - / / X 12 t- 2 d x td x \dl (47) 

9 Jj=-2 J\=o Jt=\/x s / 3 3 Ji=-i J\=o Jt=XIX s ' 3 



yielding 



This concludes the proof of Proposition ^. 3 



Vol(^(4 1) ;<5)) = ±x 5 ^ + 2 ^ 3 + ±x 5 ^ + 2S / 3 - -X 5 ' 6 = ^x^ 6 + 2S / 3 - -X 5 / 6 . (48) 
v v u " 45 15 45 45 45 



4.2 The number of quartic rings having submonogenized cubic resolvent rings 
of bounded height and bounded index 

Let us now consider pairs (Q, (C, x)), where Q is a quartic ring over Z and (C, x) is an submonogenized 
cubic resolvent ring of Q of index n. We define the height of (Q, (C, x)) by -ff (Q, (C, a;)) := H(C, x) and the 
index of (Q, (C, x)) by a(Q, (C, x)) := n. In this subsection, we count the number of isomorphism classes of 
such pairs (Q, (C, x)), where (C, x) has bounded height and index. More precisely, we prove the following 
theorem: 

Theorem 4.8 Let 5 < 1/4 be a positive constant. Let N^'(X,8) denote the number of isomorphism classes 
of pairs (Q, (C, x)), where Q is an order in an S4- or A^-quartic field having 4 — 2i real embeddings and 
(C, x) is a submonogenized cubic resolvent ring of Q having height less than X and index less than X s . Then 

(a) Ni Q) (X,S) = lx 5 / 6 + 25 /-Vo(X 5/6 + 25/3 ); 

(b) NP{X,S) = A X 5 / 6 + 25 / 3 + (A 5 / 6 + 25 / 3 ); 

(c) Ni 2) (X,S) = Lx*/* + 3e / 3 +o(X*/* + 36 / 3 ). 

4.2.1 The parametrization of quartic rings having submonogenized cubic resolvent rings of 
index n 

As in §3.2, we use W% to denote the space of pairs of integral ternary quadratic forms. Recall that the 
GL2(Z) x SL/3(Z)-orbits on Wz are in bijective correspondence with isomorphism classes of pairs (Q,C), 
where Q is a quartic ring over Z and C is a cubic resolvent ring of Q. 

Let Wz, n denote the set of all (A, B) e W z such that 4 • det(A) = n, and let Hz,i = F%,\ x SL 3 (Z). 
We have the following proposition which follows from |4j and Proposition ^. 21 

Proposition 4.9 The H%^-orbits on Wz, n are in canonical bijection with pairs (Q,(C,x)), where Q is a 
quartic ring over Z and (C, x) is a submonogenized cubic resolvent of Q of index n. 
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As in [5], we say that a pair (A, B) £ W% is absolutely irreducible if A and B do not possess a common 
rational zero (as conies in P 2 ) and the cubic resolvent form g of (A, B) is irreducible over Q. Equivalently, 
absolutely irreducible elements in W% correspond to triples (Q, (C,x)), where Q is an order in an S^- or 
Ai-quartic field. 

Now consider the space Wr of pairs of ternary quadratic forms over R. The groups Hr = FrX SL3 (R) 
and -£Tr.i = -Fr,i x SL3(R) are subgroups of Gr = GL2(R) x SLa(R) and thus act on Wr by restriction. If 
(^4, B) is an element of Wr and g(x, y) = 4 ■ det(Ar — By) is its cubic resolvent form, then the quantities 



I(A,B) 
J(A,B) 
a(A, B) 



= 1(9), 

= J (a), 

= a(9) 



(49) 



are invariants for the action of -Hr,i on Wr. We define the height of (A, B) by 

H(A, B) := H(g) = max(|/| 3 , J 2 /4). 



If (Q,(C,x)) is the pair corresponding to the i/z^-equivalence class of (A, B) £ W% >n , then we 
define the invariants I, J, and a of (Q,(C,x)) by I{Q,(C,x)) := I(A,B), J(Q,(C,x)) := J(A, B), and 
a(Q, (C, x)) := a(A, B) = n. We then have H(Q, (C, a;)) := H(C, x) = H(A, B). 

For i = 0, 1, and 2, let W^X (resp. W%\) denote the set of elements (A, B) in Wr (resp. W-£) such 
that det(A) > and A and B have 4 — 2i common real zeros in P 2 . In the remainder of this section we prove 
the following result, which by Proposition 14.91 implies Theorem [ 



Proposition 4.10 For an Hz,\-invariant set S C W%, let N(S; X, S) denote the number of ' Hz,i- equivalence 
classes of absolutely irreducible elements (A,B) £ S such that H(A,B) < X and < a(A,B) < X s . Then 
for any positive constant 8 < 1/4, we have 

(a) N(W£1;X,6) = C(2)C(3) x5/6 + 2S/3 + q(x5/6 + 2S/ 3) . 



4--) 



(b) 



N(W^l;X,6) = 8 C(2)C(3) x5/6 + 2S/3 + q(x5/6 + M/8) , 



45 



(c) A^W^JM) = 3 C(2)C(3) x 5 /6 + 2a/3 + o(x 5/ 6 + 2^ 



4.2.2 Reduction theory 

We first develop the necessary reduction theory in order to establish fundamental domains for the action of 
Hz,i on Wr i+ . We start by describing a fundamental set for the action of Hr^ on Wr,+. 



First, for i = 0,1, and 2, let W„ 

A*) 



« 



W 



(i) 



U W 



R,[3]' 



where W„ , -, denotes the set of elements 



(A, £?) in Wr' + such that the signature of A is equal to j. Let Wr denote the union UjW^ 1 ?.-,. (Note that 
every ternary quadratic form A over R that has signature — 1 is equivalent to the form Ai , where A± again 
denotes the first matrix in the pair of matrices in (|29p .) 

The results of §3.2, together with the construction in i j2.ll of fundamental sets for the action of 



GL 2 (R) on imply that for i = 0, 1, 2+, and 2- 



on W„ 



(i) 



(i) 

the set Lyjf is a fundamental set for the action of Hr 



[-1] 



where 



r(0 . 



1/2 



1/2 



a 6/2 
6/2 c d/2 
d/2 e 



6x 3 y 



2 2 
ex y 



<Ary 3 + ey £ L 



We now describe a fundamental set £^ for the action of Hr on Wr raj. If (A, B) £ Wr m, then 
the signature of A is equal to 3 and A is a positive definite ternary quadratic form. In particular, A and 
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B do not have any common real zero in P 2 and so [3] C W^ 2 ^. As A is positive definite, there exists 
7 G £Zr such that 7 • (A, B) = (id, B'). Furthermore, using the spectral theorem for real symmetric matrices, 
we may assume that 7 is chosen so that B' is a diagonal matrix. It follows that if (A, B) is a point in 
WrJ3], then there exists a point in Wr^j that is i?R-equivalent to (A, B) and has height equal to 1. Thus, a 

fundamental set L^ may be constructed by choosing one point (A, B) for each (I, J) such that H (I, J) = 1 
and 4/ 3 — J 2 > 0. We choose 



-(2#) 



1 

4i73 



,(■>) 



.(■0 



for our fundamental set for the action of Hw on Wn 



equation x 



J_ 

27 



0. 



Hence £w\ and L 



-(2) 



-(2+) 



K,[3]: 



U U L 



„( J ) 



where the r„- J ^ are the three real roots of the cubic 



-2 < J < 2 



(2#) 
V7 



form fundamental sets for the action of Ho on 



W-6l_,Wvl_ and wi 2 l, respectively. As usual, the key fact about the that we will need is that the 



,+ ' " R,+ " R,+ > 

absolute values of all the entries in the pair of matrices in the lS are uniformly bounded. 
We also require the following lemma. 



Lemma 4.11 Let (A,B) be any point in having nonzero discriminant. Then the order of the stabilizer 
of {A, B) in H-r is 4 if i = or 2, and 2 if i = 1. 

Proof: When i = 0, 1, 2+, or 2—, the result follows from Lemma |2~21 It is now enough to check the result 
for points in L\^ , where it is elementary. □ 

We now construct a fundamental domain F2 for the action of on H^. We choose T<z to be 
contained in the Siegel set N'A'KA, where 

K\ = {orthogonal transformations in He}; 
A 1 = {(t, si,s 2 ) : t > 0, s 2 ,s 2 > ci}, 



t 



where, (t, si, s 2 ) 
N' = {n(ui,u 2 ,U3,Ui) : |iti|, |u 2 |, \u 3 \, |u 4 | < c 2 } 

where, n{u\, u 2 , U3, M4) = 
A ={A:A>0}, 
where, A 



SlS 2 



s\sl 




1 




A 



A 



1 



1 



1 



and Ci, c 2 are absolute constants. We write J" 2 = J 7 ! x Jsl 3 C GL 2 (R) x SL 3 ( 



Let rij denote the size of the stabilizer in of an absolutely irreducible point in W-^ for i = 0, 1, 2; 



by Lemma 14.111 thus hq = 4, n± = 2, n 2 = 4. Then, for any fixed i, the multiset J-" 2 L^ is essentially the 

union of fundamental domains for the action of Hz,i on . More precisely, an i?z,i- equivalence x is 
represented in this multiset m(x) times, where m(x) = #Stab# R (a;)/#Stabij 2 x (a;) . Using the methods of 
the proof of Lemma T2.4[ it is easy to see that the stabilizer in Hz.i of an absolutely irreducible element is 
always trivial. We conclude that the product rii ■ N(W% ;X, S) is exactly equal to the number of absolutely 
irreducible integer points p 6 .F 2 L^ for which H{p) < X and < a(p) < X s . 
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4.2.3 Estimates on reducibility 

By our construction of the Lyy> the resolvent binary cubic forms of points in Lyy lie in for i = 0,2, 

and in L$ for i = 1. As in ^TOl we consider the set n x (L^;5) := T ( 2 A) L$ n {p : |F(p)| < X}, where 

J 7 ^ C .F 2 consists of those elements (71, 72) € .F 2 that satisfy 71 € J 7 } 5 ''. We shall need the following lemma 
whose proof is deferred to M4.6I 

Lemma 4.12 Let S < 1/4 be a positive constant and let g be any element in a compact subset Hq of Hr. 
The number of points (A,B) G gL^ fl Wz £/ia£ are not absolutely irreducible and satisfy an 7^ 0, 
H(A,B) < X, and < a(^4,-B) < X s is o(A 5 / 6 + 2 / 35 ), where the implied constant depends only on Ho. 

4.2.4 Averaging and cutting off the cusp 

If -y = (71,72) is an element of TJr, then we define the determinant of 7 by det(7) := det(7i). Let Hq be a 
K\- invariant compact set in that is the closure of some nonempty open set, such that every element in 

(i) 

Hq has determinant greater than 1. Let S C W% be an iJz i-invariant set and let 5' lrr denote the subset of 
absolutely irreducible elements in S. Then we have 

J #{x G 5 fa n F 2 hL§ : H(x) < A, < a(x) < X s }dh 
JM(b;X,d) - j. — . 

Just as in §2.3, we see that if C^ g = rii ■ J heHo dh, then we have 

N{S- A, 5) = -L- f #{x G S' irr n .gi?o^w : < A, < a{x) < X s } dg. (50) 

Since #0 is K\- invariant, we see that the right hand side in (I5T))) is equal to 
~4r [ #\x e Sennit, s 1 ,s 2 )\HQL ( $:H{x)<X,0<a(x)<X s )t- 2 s7 6 s~ 6 dnd x td x sd x \. (51) 

We define B(n, (t, s\, S2), A, A, 0") by 

B(n, (t, si, s 2 ), A, A, 5) := jx G n (t, si, s 2 ) A-ff .L$ : < A, < a(i) < A 5 } . (52) 

Then 

N(S;X,S) = -Lr j #{x G S' irr nB(n,(t,si,S2),A,A,(5)}r 2 sr 6 ^ 6 ^a' x trf x srf x A. 

Let Ci and C2 be constants that bound the value of a(A, B) for all (A, B) G HqL^J from above and 
below, respectively. Let C3 be a constant such that 1/C| 2 bounds the height of all the the points in HqL^ 
from above. We then have the following proposition whose proof is identical to that of Proposition 14.61 

Proposition 4.13 If either < 1, ° 2 X > A" 5 / 3 , or C 3 A < 1, i/ien i/iere are no absolutely irreducible 
points x in W% n B(n, (t, si, s 2 ), A, A, S). 

We also have the following lemma whose proof we omit as it exactly follows that of [5j Lemma 11]. 

Lemma 4.14 Let h be an element of Hq. Then the number of absolutely irreducible elements (A,B) G 
T^hL n Wz such that an = and H(A, B) < X is 0(A 5 / 6+<5 / 3 ). 



38 



Thus, to prove Proposition 14.101 we may assume that an 7^ 0. If B(n, a(t, Si, S2), A, X, S) contains 
an absolutely irreducible point (^4, B) £ W% with a n 7^ 0, then | Sl ^ 2 \ — O(l). From Proposition 14.131 we 
see that sf = 0(X S ^ 3 ) and s 2 . = 0(X S ^ 3 ). Equation (|51j) and Proposition 12 . 51 now imply that up to an error 
of o(X 5 / 6+2 ^ 3 ) coming from LemmaEHl N(W^ ; X, 6) is equal to 

/ 1/3 / / / (Vo\(B(n,(t,s 1 ,s 2 ),\,X,8)) + 0(\ 11 tsfsl))sf\s2 6 t- 2 dnd x td x sd x \, 

CY,' J\=C Jt=Zi TT ^Js 1 =l/2 Js 2 =1/2Jn' 

-Ho X s / 3 ' 1 

(53) 

where k\ and k 2 are absolute constants. The integral of the error term in ([53| is evaluated to be 0(X 5 ^ 6 + s / 3 ) 
while that of the main term is equal to 



f-OO f-OO p 

Vol(1l x (L§;S))-^r / 1/s / / \ 12 S ^s^t- 2 dnd x td x sd x X 

Cw J\=c Jt=%j^ Jktxs/^ Jk 2 xw Jn> 



= Vo\(TZ x (L^;S)) + 0(X 5 / 6 ). 
Therefore, to prove Proposition 14. 101 it remains to compute the volume of lZx(L^ ; 6). 
4.2.5 Computing the volume 

Define the usual subgroups N, A, N, and A of Hr as follows: 

N = {n(x) : x E M 4 }, where n(x) = (( 1 ^ ) , [ 1 x 4 ) ) ; (54) 

• 1 = !('•/. •/•-) : ' = R$ 4 }, *iv«|l|= I ( ' . j . I I I : (55) 

tr 1 t 2 - : 

AT = : it € M 4 }, where n(u) = f ( ] 1 ( u 2 1 ) | ; (56) 

A = {A e where A = ( ( A A ) , | 1 | | . (57) 

It is well-known that the natural product map JVx4xAfxA-> £Zr maps bijectively to a full measure set 
in Hr. This decomposition gives a Haar measure dg on iJg, namely, dg — t~ 2 du\d x td x \dh, where dh is the 
Haar measure on SL3(R) given by dh — £f t 2 2 dxd x tid x t2du. 

Fix i to be one of 0, 1, 2+, 2—, or 2#. Let L := and n := rij. Write L as a union L = L 1 U i J , 
where L 7 contains the points where J = ±1 and L J contains the points where / = ±1. These subsets again 
intersect only in finitely many points. Let W 1 and W J denote the sets • L 1 and Hy& ■ L J , respectively. 

Let dl and dJ be the measures on L 1 and L J , respectively, and let dv be the usual Euclidean volume 
on Wr. Then we have the following proposition whose proof is identical to that of Proposition 12. 71 

Proposition 4.15 Let f be any measurable function on Wr. Then 

(det h) w f(h ■ (A, B))dhdl = [ f(v)dv; 
(A.B)eL 1 Jhem Jw 1 




(b) i- / / (det h) 10 f(h ■ (A, B))dhdJ — f f(v)di 

971 J(A,B)£L J Jh£H m JW J 



3!) 



We now compute the volume of lZx(L;5). First assume that L is L w ' , L K W , L w , or L w ' . If 
7 € ffig has determinant equal to A 2 , then if (7 • w) = A 12 for v £ L. Therefore, by Proposition ^. 15[ we have 

Vol(ft x (L;$)) = |c(2)C(3) / / / \ 12 r 2 d x td x Ad J = 4C ™ 3) (X 5 / 6 + 2a / 3 - X 5 / 6 ). 

Similarly, if L = £^ , then by Proposition 14. 151 the volume of lZx(L; 5) is equal to 

f [ X f X \ 12 t- 2 d x td x \dJ + 2-U{2)({3) f 1 f X ( X X 12 t- 2 d x td x XdI 

J J=-2 J A=0 Jt=\/X s /3 3 ■//=-! J\=0 Jt=X/X s / s 



9 .'j=-2 A=o Jt=\/X s / S 45 



|C(2)C(3) 



yielding 



/j^-5/6 + 2(5/3 _ _X"5/6\ 

V 45 15 / 45 

Since n, ; = 4 for i = 0, 2+, 2—, 2# and rii = 2, we obtain Proposition 14. 101 



4.3 Congruence conditions 

For any set S in Uz (resp. Wz) that is definable by congruence conditions, we denote the p-adic density of 
the p-adic closure of 5* in Uz p (resp. Wg, ) by /j, p (S), where we normalize the additive measure /i p on U% p 

(resp. Wz p ) so that n P (Uz p ) = 1 (resp. /i p (W Zp ) = 1). 

We then have the following propositions whose proofs are identical to that of Theorem 12.111 

Proposition 4.16 Suppose S is a subset of U% defined by finitely many congruence conditions. Then 

N(S; X, 5) = N(uj?>;X, S) [] fx p (S) + 0(X 5 / 6 ). (58) 

v 

Proposition 4.17 Suppose S is a subset ofW% defined by finitely many congruence conditions. Then 

N(S;X,S) = N(W^;X,S)Y[^ p (S)+o(X 5 / 6 + 2S / 3 ). (59) 

p 



4.4 Values of p-adic densities 

In order to count the number of 2-torsion elements in the class groups of submonogenized cubic fields of 
bounded height and index, we need to count i^.i-equivalence classes of maximal elements in Uz of similarly 
bounded height and index, where an element in U% is said to be maximal if the corresponding cubic ring 
is maximal. We also need to count iJz,i-equivalence classes of strongly maximal elements in W% having 
bounded height and index, where a strongly maximal element of Wz is one whose associated binary cubic 
resolvent form in Uz is maximal. 

We say that an element g G Uz is maximal at p if the corresponding cubic ring is maximal at p. 
Similarly, we say that an element ( A, B) € W% is strongly maximal at p if its associated cubic resolvent ring 
is maximal at p. Then an element g £ Uz (resp. (A, B) € Wz) is maximal (resp. strongly maximal) if and 
only if it is maximal (resp. strongly maximal) at all p. 

We denote the set of elements in Uz that are maximal at p by U P {U) and the set of elements in Wz 
that are strongly maximal at p by V P (W). We then have the following lemma (see [9], [4]). 

Lemma 4.18 We have 

V P (U p (U)) = (p 3 -l)(p 2 -l)/p 5 ; 
V P (V p (W)) = (p 3 ~l) 2 (p 2 -l) 2 / P w . 
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4.5 Uniformity estimates 

We require the following two uniformity estimates in order to carry out the sieve step in the proof of 
Theorem OUl 

Proposition 4.19 Let 5 < 1/4 be a positive constant and let W P (U) denote the set of all binary cubic forms 
that correspond to cubic rings not maximal at p. Then N(Wp(U);X,5) = 0{X 5 / 6 + 2S ^/p 1 + 4S ^), where the 
implied constant is independent of p. 

Proof: Let p be a fixed prime and assume that p = X a . We see from [9j Lemma 9] that a cubic ring C can 
fail to be maximal at p in two possible ways: Either C has a Z-basis (l,ui,0) such that C — Z+Z-(u>/p) + 'Z-9 
forms a cubic ring or has a Z-basis (1,uj,9) such that C" = Z + Z • (oj/p) + Z ■ (9/p) forms a cubic ring. 
Now H(C',x) — H(C,x)/p 2 and H(C",x) — H(C,x)/p 4 . Furthermore, as a(C,x) is equal to the index of 
Z[x] in C, it follows that a(C',x) = p ■ a(C,x) and that a(C",x) — p 2 ■ a(C,x). Thus, identically as in [5J 
Proposition 20], we see that 

N(W P (U);X, 6) = 0(N(U z ; X x ~ 2a , 5 + a) + N(U Z ; X^ Aa , 5 + 2a)). 

The result now follows from Proposition [473] □ 

Proposition 4.20 Let 5 < 1/4 be a positive constant and let YV P (W) denote the set of all (A, B) £ W% that 
are not strongly maximal at p. Then N(W P ;X,S) = 0(X 5 ^ 6 + 2S ^ 3 /p 1 + 4S ^ 3 ) where the implied constant is 
independent of p. 

Proof: Let Wp 1] {W) be the set of all (A, B) € W% that are nonmaximal at p and let Wp 2) {W) be the 
set of all (A, B) £ Wz that are maximal at p but also overramified at p. A cubic resolvent ring of a 
quartic ring Q is maximal if and only if Q is maximal and not overramified. Thus N(W P (W); X, 5) = 
N(W^ ] (W);X, 6)+N(W P [2) (W);X,5). 

Now suppose that p is equal to X a . Then an argument identical to the proof of (5j Proposition 23] 
shows that we have 

iV(wW(W0;JM) = 0{N{W z - X 1 - 2a ,6 + a)) = 0(X ( - 1 - 2a ^i+i^ s+a ^) = 0{xi+i 5 - a -^ aS ), 

N(W p 2) (W);X, S) = 0{N{Wr, X 1 ~ 2a , 8 + a)) = 0(X (1 - 2a '>^+^ s+a ^) = 0{xi + i s - a -^ a5 ), 
yielding the proposition. □ 

The results of this section also allow us now to finally prove the key uniformity estimate for binary 
quartic forms that was used in Section 3, namely, Proposition 13.181 This will complete the proof of Theo- 
rem [OJJ We note that Proposition 13 . 1 81 will also prove a key role in the proofs of Theorems 1 1 . 1 1 and 1 1 . 31 

Proof of Proposition 13.181 Let Wp 1 ^ (V) be the set of all binary quartic forms f £ V% that are nonmaximal 

(2) 

at p and let W p (V) be the set of all forms / € W% that are maximal at p but also overramified at p. Clearly 
N(W P (V);X) = NiW^iV^X) + N(W^ 2 \V);X). Then, as in [5, Proposition 23], we have 

N(yV^\V);X) =0{N{Wz; X 1 - 20 , a)) = 0(X^~ 2a ^ + ^) = 0(Xs- Q -^ 2 ), 

NiW^iV)-^) ^ OiNiW^X 1 - 2 "^)) = 0{X {1 - 2a) ^+^) = 0{xi- a -^ a2 ). 

Consider the pair (Q, (C, a;)), where Q is a quartic ring and (C, x) is a submonogenized cubic resolvent ring 
of Q. The discriminant of the pair (Q, (C, x)) is less than twice the height of (Q, (C, x)) Therefore, the fact 
that a cubic ring has at most 12 monogenizations (see [3S] and [37]) combined with [SJ Proposition 23] also 
yields the estimate 

N(W P (V);X) = 0(X/p 2 ) = OiX 1 - 201 ). 
Combining the three estimates we obtain that N(W P (V);X) = 0{X 5 ^ 6 /p 1+2/3 ) 7 as desired. □ 
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4.6 Proof of Lemma 14.121 



The proof of this lemma is very similar to that of [SJ Lemma 13]. Recall that if an element (A,B) £ W% 
corresponding to the pair of rings (Q, C) is not absolutely irreducible, then Q is not an order in an S^-ficld. 
This implies that (4), (13), and (112) cannot all occur as splitting types of primes in Q. Indeed, if all three 
of these splitting types occur in Q, then Q must be a domain, and the Galois group G of the Galois closure 
of the quotient field of Q over Q must contain a 4-cycle, a 3-cycle, and a 2-cycle, implying that G = S4. 

Let us denote by T P (W,9) the set of elements in Wz whose splitting type over p is 9. It follows 
from [4j Lemma 23] that fA p (T p (W, (4))) approaches 1/4 as p — > 00. Therefore the total number of points 
(A, B) e F^hL^ having height less than X, an ^ 0, and such that the quartic ring associated to (A,B) 
does not have splitting type (4) over any finite prime p is 

0( lim n(l-ft(^W(4)F l+f )=o(I 5/6+35/3 ). 

p<JV 

Similarly, the total number of points (A, B) £ F^hLyy having height less than X, an ^ in J 7 ^ hLy^ , and 
such that (A, B) does not have splitting type (112) (resp. (13)) is also o(X 5 ^ 6 + 2(5 / 3 ). The lemma follows. 



4.7 Proof of the main theorem (Theorem 11.101) 

By Lemmail we have fJ,(U p (U) = {p 3 - l){p 2 - l)/p 5 and fi{V p (W) = {p 2 - l) 2 (p 3 - l) 2 /p 10 . Identically 
as in the proof of Theorem 11.91 we use Theorems 14.11 and 14.81 and Propositions 14.191 and 14.201 to obtain 
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lim 
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45 



\H(K)\<X 
Disc(K)>0 
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n 



KV P (W)) _ C(2)C(3)C(2)C(3) 



M(W P (C/) 



C(2) 2 C(3) 2 



where K ranges over isomorphism classes of submonogenized cubic fields having index at most X s . This 
proves the weaker version of Theorem 11.101 where we average over all submonogenized fields of index at most 
X s and height at most X , without any specified splitting conditions. 

To complete the proof of Theorem ll.lOl in all cases, we proceed as in £13.71 Let p be a fixed prime. For 
a cubic splitting type a, let U P (U, a) denote the set of integral binary cubic forms in U P (U) whose associated 
cubic ring has splitting type a over p. Similarly, for a quartic splitting type 8, let V P (W, 9) denote the set of 
elements in V P (W) whose associated quartic ring has splitting type 9 over p. Then it suffices to prove that 



H P {U P {U)) 



ii P {V p {W)) 



for all values of p and a. This equality follows from the computations in 
and is summarized in Table |4j we have proven Theorem 11.101 



Lemma 10] and [H Lemma 23], 
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1 


p 2 + p4 


1 






p 2 + p + 1 



Table 4: Demonstration of the equality of p-adic density ratios for U and W 



5 The mean size of the 2-Selmer group of elliptic curves 

Recall that every elliptic curve E over Q can be written in the form 

E(A,B) : y 2 = x 3 + Ax + B, (60) 

where A, B £ Z and p A \ A if p 6 \ B. For any elliptic curve E(A, B) over Q written in the form (|50)) . we 
define the quantities 1(E) and J(E) by 

1(E) := -3A, 
J(E) := -27B. 

In the introduction, we had defined the height of E(A, B) by 

H(E(A, B)) = max{4|A 3 |,27S 2 }. 
In this section, we shall work with the slightly different height H'(E) defined by 

H'(E) :=mnx(\I(E)\ 3 1 J(E) 2 /A). 

Note that H and H' only differ by a constant factor; namely, for every elliptic curve E over Q we have 
27 H(E) = 4H'(E). 

In this section, we prove Theorem ll.3l bv computing the average size of the 2-Selmer group of rational 
elliptic curves when these curves are ordered by their heights (note that the two heights H and H' give the 
same ordering on every set of elliptic curves) . Theorem II. H being a special case of Theorem 11.31 will then 
follow. 

In fact, we prove a stronger statement than Theorem 11.31 To state this result, we need some 
notation. If F is a set of elliptic curves over Q defined by congruence conditions, then we denote by F lnv 
the set {(1(E), J (E)) : E € F}. We denote the p-adic closure of F inv in Z p x Z p by F" v . We say that 
such a set F of elliptic curves over Q is large at p if the set of all monogenized cubic rings over Z p having 
invariants equal to (/, J) G F™ v contains all the maximal monogenized cubic rings over Z p . We say that a 
set of elliptic curves F is large if it is large at all but finitely many primes p. In this section, we prove the 
following strengthening of Theorem 11.31 

Theorem 5.1 When elliptic curves E in any large family are ordered by height, the average size of the 
2-Selmer group 82(E) is 3. 
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Note that the set of all elliptic curves is large. So too is the set of elliptic curves E : y 2 = g{x) 
defined by finitely many congruence conditions on the coefficients of g. Thus Theorem If .31 indeed follows 
from Theorem 15. f I 

Remark on notation Unlike the introduction, we denote the elliptic curve E : y 2 = x 3 + Ax + B by 
E(A, B) in this section. Also, from hereon in, we denote the elliptic curve having invariants equal to I and 
J by E ItJ . 

5.1 Integral binary quartic forms and the 2-Selmer group of elliptic curves 

Recall that an element in the 2-Selmer group of an elliptic curve E/Q may be thought of as a locally soluble 
2-covering of E/Q. Moreover, a locally soluble 2-covering of E/Q that is defined over Q has a degree 2 
divisor defined over Q, thus giving a binary quartic form over Q well-defined up to GL2(Q)-equivalence. 

We say that two binary quartic forms fi and ji with coefficients in a field K are K-equivalent if 
there exists /i G K and 7 G GI^-ftT) such that /1 = fJ. 2 ^ ■ $2)- For ease of notation, if two binary quartic 
forms over Q are Q-equivalent, then we simply say that they are equivalent. We then have the following 
lemma (see [12l Lemma 2] and the discussion following it). 

Lemma 5.2 Let E be an elliptic curve defined over Q. If 9 : C — > E is a locally soluble 2-covering in G, 
then there is a model for C of the form 

z 2 = f(x) = ax 4 + bx 3 + cx 2 + dx + e (61) 

where a,b,c,d,e € Q. Moreover, the binary quartic form F(x,y) = ax 4 + bx 3 y + cx 2 y 2 + dxy 3 + ey A has 
invariants equal to \ 4 I{E) and X 6 J(E) for some A G Q. Finally, two 2-coverings C\ and C2 of E represented 
by f\(x) and f2(x), respectively, give isomorphic 2-coverings of E if and only if F\(x,y) and F2(x,y) are 
equivalent. 

We say that a binary quartic form / G Vq is locally soluble if the equation z 2 — f{x, y) has solutions 
in K and in Q p for all primes p. Since every binary quartic form with rational coefficients is clearly equivalent 
to one with integer coefficients, Lemma 15.21 implies that if E/Q is an elliptic curve having invariants / and J, 
then the set of elements in the 2-Selmer group of E is in bijective correspondence with the set of equivalence 
classes of locally soluble integral binary quartic forms having invariants A 4 / and A 6 J for some A G Q. 

We now recall the following facts which are Lemmas 3, 4, and 5 of [T2] . 

Lemma 5.3 Let f be an integral binary quartic form having invariants I and J. Let p > 5 be a prime such 
that p 4 \ I, p e \ J, and the equation z 2 = f(x,y) has a solution overQ p . Then there exists an integral binary 
quartic form f equivalent to f having invariants p i and p~~ 6 J . 

Lemma 5.4 Let f be an integral binary quartic form having invariants I and J. Suppose that 3 5 | /, 3 9 | J 
and z 2 = f(x,y) has a solution in Q3. Then there exists an integral binary quartic form f equivalent to f 
having invariants 3 -4 / and 3 -6 J. 

Lemma 5.5 Let f be an integral binary quartic form having invariants I and J. Suppose that 2 6 | /, 2 9 | J , 
2 10 I (87 + J), and z 2 = g(x,y) has a solution in Q2. Then there exists an integral binary quartic form f 
equivalent to f having invariants 2~ 4 I and 2~ 6 J. 

We may use these lemmas to prove the following theorem. 

Theorem 5.6 Let E be a fixed elliptic curve over Q having nonzero discriminant and invariants equal to 
I and J. Then the elements of the 2-Selmer group of E are in one-to-one correspondence with PGL2(Q)- 
equivalence classes of locally soluble integral binary quartic forms having invariants equal to 2 1 and 2 6 J. 

Proof: Let / be a locally soluble integral binary quartic form having invariants equal to A 4 / and A 6 J for 
some A G Q. We first prove the existence of an integral binary quartic form /' equivalent to / such that /' 
has invariants equal to 2 4 J and 2 6 J. 
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We find such an /' as follows. Firstly, note we may assume that A € Z and that 2 divides A. Indeed, 
if A = p/q with p, q £ Z, then we may replace / by 4q 2 /. Secondly, by Lemma |5.3[ we may assume that no 
prime p > 5 divides A. Now if 3 fe divides A for some integer k > 1, then since I = 1(E) is divisible by 3 and 
J = J(E) is divisible by 27, we see that 3 5 divides /(/) = A 4 / and 3 9 divides J(g) = A 6 J. Therefore, we may 
repeatedly use Lemma T5. 41 to assure that 3 does not divide A. Finally, if 4 divides A, then 2 8 divides I and 
2 12 divides J. Now, repeated use of Lemma 15.51 yields an integral binary quartic form /', having invariants 
equal to 2 4 / and 2 6 J, that is equivalent to /. 

Therefore, by Lemma 15.21 the set of elements in the 2-Selmer group of E are in one to one corre- 
spondence with equivalence classes of locally soluble integral binary quartic forms having invariants equal to 
2 4 I(E) and 2 6 J(E). Theorem 15.61 now follows from the observation that two integral binary quartic forms 
having the same invariants are equivalent if and only if they are PGL2(Q)-equivalent (if g = /i 2 (7 • /), then 
I( g ) = M 4 (det7) 4 and 1(g) = ^ 6 (det7) 6 and so y 2 = l/(det 7 ) 2 . Thus / and g are PGL 2 (Q)-equivalent.) □ 

An elliptic curve E : y 2 — x + Ax + B over Q has a non-trivial 2-torsion point defined over Q if 
and only if the corresponding cubic equation x 3 + Ax + B has a rational root. We have the following two 
propositions; the first follows immediately from Lemma 13.31 while the second follows from Lemma 13.211 and 
Theorem 15.61 

Proposition 5.7 The number of elliptic curves E over Q such that E has a non-trivial rational 2-torsion 
point and H(E) < X is 0(X 1 / 2+e ). 

Proposition 5.8 The total number of elements in the union of the 2-Selmer groups of all elliptic curves 
over Q having a nontrivial rational 2-torsion point and height bounded by X is 0(X 3 ^ 4+€ ). 

We say that an elliptic curve E over a field K is rigid if 1(E) ^ and J(E) ^ 0; rigid elliptic 
curves over A are precisely those elliptic curves that have no extra automorphisms over A (see e.g., |40[ 
Thm. 10.1]). Counting just those points with I = or J = in the proofs of Proposition 12 . 101 and Theorem 
12 . H we easily obtain the following proposition: 

Proposition 5.9 The number of non-rigid elliptic curves over Q having height bounded by X is 0(X 1 ^ 2 ). 
The total number of elements in the union of the 2-Selmer groups of non-rigid elliptic curves having height 
bounded by X is 0(X 3 / i ). 

We will see that the total number of elliptic curves having height bounded by X in any large family 
is ^> X 5 / 6 . Thus, for the purposes of proving Theorem 15.11 it suffices to consider only those elliptic curves 
over Q that are rigid and have no nontrivial rational 2-torsion point. 

5.2 Computations of p-adic densities in terms of local masses 

Suppose if is a field and E is an elliptic curve defined over K . As in the special case K = Q discussed in the 
introduction, the A"-isomorphism classes of soluble 2-coverings C of E correspond bijectively to elements of 
E(K) /2E(K). These soluble 2-coverings C then possess a degree two line bundle over K, thereby yielding 
a double cover of P : (A) ramified at four points. We thus obtain a AT-soluble binary quartic form with 
coefficients in A, where a binary quartic form / over A is called K-soluble if the equation z 2 — f(x, y) has 
a solution in A. This yields the following lemma. 

Lemma 5.10 Let E be an elliptic curve over a field A having invariants 1^0 and J ^ 0. Then there 
exists a natural map 

Qe ■ E(K)/2E(K) — > {K -equivalence classes of quartics}. 

This map is injective and the image consists exactly of the K-soluble K -equivalence classes of quartics having 
invariants equal to I and J. 
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See, e.g., [TH1 Proposition 2.2] for an explicit construction of Qe when K has charactaristic not 2 or 3. 

Now a 2-covering C of an elliptic curve E over if can be viewed as a principal homogeneous space 
for E/K. The curves E and C are isomorphic over K and the automorphisms of C over K correspond to the 
2-torsion points of E(K). In fact, a 2-torsion point of E(K) yields an automorphism of C over K simply via 
the action of E on C . Moreover, this automorphism is defined over K if and only if the 2-torsion point in 
question is itself defined over K . In conjunction with Lemma 15.101 the above discussion yields the following 
lemma. 

Lemma 5.11 Let E be an elliptic curve defined over a field K having nonzero invariants. If a G E(K)/2E(K) 
corresponds to the K -equivalence class of the binary quartic form f under the correspondence of Lemma \5.10[ 
then the size of the stabilizer of f in PGL2(-RT) is equal to #E(K)[2], the number of 2-torsion elements in 
E defined over K . 



We now turn to Theorem 15.61 which asserts that elements that are not the identity in the 2-Selmer 
group of an elliptic curve E] t j over Q are in bijective correspondence with PGL2(Q)-equivalence classes of 
locally soluble irreducible integral binary quartic forms having invariants 2 4 / and 2 6 J. In §2, we computed 
the number of GL2(Z)-equivalence classes of integral binary quartic forms having bounded height. In order 
to use the latter result to compute the number of PGL2(Q)-orbits of integral binary quartic forms, we 
construct a set of integral binary quartic forms by choosing, from each such PGL2(Q)-orbit, one GL2(Z)- 
orbit of integral binary quartic forms. In the remainder of this section, we compute the p-adic density of the 
union of these GL2(Z)-orbits. 

To this end, for any (/, J) € Zx Z, let C\, . . . , Cfc be a set of PGL2(Z)-orbits on the set of locally 
soluble integral binary quartic forms having invariants equal to / and J. Let , . . . , be a maximal set 
of PGL2(Q)-inequivalent classes among the C\, . . . , Ck, and define S I,J to be the union of all the integral 
binary quartic forms in the CV over all j. Define S to be the union of S 2 7,2 J over all (/, J) € F lnv . We 
now determine the p-adic density /j, p (S F ) of the p-adic closure of S F in Vz in terms of a local (p-adic) mass 
M p (V, F) of all isomorphism classes of soluble 2-coverings of elliptic curves over Q p : 

Proposition 5.12 We have n P (S F ) = |2 10 /3 3 | p • M p (V,F), where 

M,(v;i0:=(i-i) / E 

V V J J /,J)GF'° V ^jp. i. 



-dldJ. 



Proof: Let F™ v again denote the p-adic closure of F mv in Z p x Z p , and let S p be the p-adic closure of 
S F in Vz p . For (/, J) e F p nv , let B ! ' J consist of a set of representatives fx, ■ ■ ■ , fk € Vz p for the action of 
PGL2(Q P ) on the set of soluble binary quartic forms in V% having invariants equal to 2 4 I and 2 6 J. Define 
B F by ' 

B F := |J PGL 2 (Z p ) • /. 

(/,./)e-F; nv 

The p-adic density n v {S F ) can be determined from the set B F ; namely, we have 

dv= [ 1 df 

ves£ JfeBF #Aut(/) 

where Aut(/) denotes the stabilizer of / in PGL2(Q P ). The latter integral can be computed using a Jacobian 
change of variables; indeed, Proposition [278] and the principle of permanence of identities imply that 

/ -H- a = |2l ° ' 2/27| P Vol(PGL 2 (Z,)) / E inm - 

#Aut(/) J(i,J)eF- r t *-t j #Aut(/) 

Note that we have the factor 2 10 • 2/27 instead of 2/27 because / € B I,J has invariants 2 4 / and 2 6 J as 
opposed to / and J. 
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By Lemmas 15.101 and 15.111 elements / <E -Bp" 7 correspond bijectively with elements a E E] j/2Ei^j, 
and the cardinality of Aut(/) is equal to the cardinality of 2?j j(Q p )[2] whenever 1^0 and J ^ 0. Proposition 
15. 121 now follows because the volume of PGL 2 (Z p ) with respect to the Haar measure obtained from the NAN 
decomposition of PGL 2 (Q p ) is equal to (1 — l/p 2 ) for p > 3 and to 2(1 — 1/2 2 ) for p = 2 (along with the 
fact that the measure of the set {(0, J)} U {(I, 0)} C F inv is equal to 0). □ 

In the analogous manner, if F is a large set of elliptic curves, then we may define M p (Ui,F) to be 
the measure of with respect to the measure dl dJ on Z p x Z p , where the measure dl dJ is normalized 
so that Z p x Z p has measure 1. That is, we have 



5.3 Uniformity estimates 

In this subsection, we prove a uniformity estimate that, analogous to the proofs of Theorems 11.91 and 11.101 
allows us to carry out the sieve step in the proof of Theorem 15. II 

We say that an element / € Vz is bad at p if either / is not Q p -soluble or there exists an element 
/' G Vz such that / is PGL 2 (Qp)-equivalent to /' but not PGL 2 (Z p )-equivalent to /'. Then we have the 
following proposition: 

Proposition 5.13 The number of 'PGLi2(Z)- equivalence classes of elements in V% that are bad at p and have 
height less than X is 0(X 5 / 6 /p 5//3 ) 7 where the implied constant is independent of p. 

Proof: Let p > 5 be a fixed prime. We show that if an element / € V% is bad at p, then / is not strongly 
maximal at p. Then Proposition 13. 181 (proved in ij4.5p will imply the result. 

If /, /' € Vz are PGL 2 (Qp)-equivalent but not PGL 2 (Zp)-equivalent, then by replacing / with a 

form that is PGL 2 (Z p )-equivalent to it, we may assume that / = 7 • /', where 7 = ( p p _„ ). It is then clear 
that / is not maximal, and therefore not strongly maximal, at p. 

Next, we show that if / £ Vz is not Qp-soluble, then it is again not strongly maximal at p. First, if 
the discriminant of / € Vz p is prime to p, then / is Qp-soluble (see [211 Chapter 3.6]). Also, if the splitting 
type of / at p is (1 2 H) or (1 3 1), then the reduction of / modulo p has a single root in P 1 (F p ), which lifts to 
a root in P 1 (Q P ) by Hensel's Lemma. Thus / is Qp-soluble. If the splitting type of / over p is (2 2 ), (1 2 1 2 ), 
or (l 4 ), then / is overramified and is therefore not strongly maximal. 

Thus it suffices to prove that if the splitting type of / at p is (1 2 2), then / is (Qp-soluble. If / G Vz p has 
splitting type (1 2 2), then the reduction of / modulo p can be assumed to be of the form ax 2 (x 2 — hy 2 ) , where 
n is a nonresidue modulo p. Hence we may assume that / is PGL 2 (Q p )-equivalent to a(x 2 — kpy 2 )(x 2 — ny 2 ), 
where a,n,k € Z p , the element n £E Z p is a nonresidue when reduced modulo p, and p \ a. If a is a square in 
Qp, then /(l, 0) is a square in Q p and we are done. So we may assume that a is a nonsquare. Now if p \ x, 
then x 2 — kp is a square in Q p ; so it suffices to prove the existence of xo £ F* such that Xq — n is a nonresidue 
modulo p. Suppose this is not the case, and denote the residues in Fp by ri, . . . ,rt, where t = (p — l)/2. If 
for all i we have that rj — n is a residue, then by the pigeonhole principle there exists c < I + 1 and a residue 
r € such that r — cn = r, which is a contradiction. □ 

5.4 Proofs of the main theorems (Theorems II. 1L II. 3L and 15.11) 

We prove the following theorem, from which Theorem 15.11 will be seen to follow. 
Theorem 5.14 Let F be a large set of elliptic curves. Then we have 




did J. 



(62) 



£ (#S 2 (E)-1) 



lim 

X^oo 



EeF 

H'(E)<X 



C(2)II 



M P (V,F) 
M P (U U FY 



(63) 




1 



p 



EG-F inv 
H'(E)<X 
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Proof: We begin by counting the number of elliptic curves in F that have bounded height in terms of the 
3 



volumes of the sets which were defined in the proof of Proposition 12.101 by 



R+ = e K 2 : \i\ < X 1 / 3 , \j\ < 2X 1 ' 2 , 4z 3 - f > 0}, 

R~ x = e R 2 : |*| < X 1 / 3 , \j\ < 2X 1 '\Ai i - f < 0}. 

Lemma 5.15 With the notation of Theorem \5. 14\ we have 

l = [Vol(R+) + Vol(Rl)]-l[M p (U 1 ,F)-X 5 / 6 + o(X 5 ^), (64) 
EeF p 

H'{E)<X 

Proof: It was proved in Proposition ^. lOl that the total number of points (I, J) € ZxZ such that H(I, J) < X 
is equal to Vol(iZi) + Vol(iZ^), up to an error of X l ' 2+e . From (f6"2"|) we see that the quantity M p {U\,F) is 
equal to the p-adic density of F p nv in Z p x Z p . The lemma now follows from Proposition 13. 1 71 along with the 
condition that F is a large set of elliptic curves. The proof is identical to that of Theorem II .91 in £)3.7I □ 

Now note that the numerator of the left hand side of the equation in Theorem 15.141 is equal to 
N(S F ; 2 12 X), the number of PGL2(Q)-equivalence classes of locally soluble irreducible integral binary quartic 
forms having invariants 2 A I and 2 6 J, such that (I, J) € F inv and H(2 i I, 2 6 J) < 2 12 X. By Propositions I5TT21 
and !5.13l we have, up to an error of o(X 5 / 6 ), that 

N(S F ;2^X) = N(V^UV^ +) UV^^X)-l[^S F ) 

p 

= N(V^ U Vf +) U V^ ] -2 12 X) ■ J] 1^/3% ■ M P (V,F) 



|C(2)2 10 



\ + voi(i?+) + l - ■ voi(i? r 



^ , (65) 

l[\2 10 /3\-M p (V,F)-X 5 ^ 



c(2)[voi(i?+) + voi(i?r)] • JJ ■ ^ 5/6 ; 



the proof is again identical to that of Theorem ll.9l Taking the ratio of (|65|) and (f64|) now yields Theorem l5.14l 

□ 

To evaluate the right hand side of (|63|) . we require the following fact (see [HI Lemma 3.1]): 
Lemma 5.16 Let E be an elliptic curve over Q p . XTien 

#E[2}(Q p ) ifp^2: 
2-#£[2](Q p ) ifp = 2; 
By Proposition 15.121 and (|62|) . the right hand side of (|63|) is equal to 



#(E(Q P )/2E(Q P )) = 



TT / d/dJ 

_ J(i.j}£Fi m 



'(/,J)6F» 

which is then equal to 2 by Lemma r5.16l We have proven Theorem 15 . II (and thus also Theorems 1 1.1 1 and 1 1.31) . 
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